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Problems and Techniques of Text Analysis 

The Text Analysis 

The goal of the Text Analysis groups of the Center for the Study of 

Reading Is to investigate the problem of reading comprehension from the 

standpoint of properties of texts, In order to understand the contribution 

of texts to difficulty or ease of reading, and to construct appropriate 

theories to account for text properties. The need for this kind of research 

is clear. Given a solid understanding of these matters, texts could be 

i 

more effectively evaluated for quality, and for level of difficulty; this 
would allow careful matching of the text to the needs of the chiiu, and it 
would probably be possible to construct more accurate diagnostic tests of 
reading g^roffclency. A more complete understanding of the text properties 
and their role In reading comprehension would no doubt serve as a basis for 
instruction, as wei i . 

It is important to keep in mind, of course, that difficulty is not 
necessarily a bad thing. A gross method of detecting difficulty level would 
be nearly vyorthless if it could riot distinguish between a text that was 
wel 1 -constructed but made chal leng I ng demands on the reader, and one that 
was so poorly constructed that it contained unnecessary obstacles to com- 
prehension. Children, like adults, can learn by be 1 ng .chal 1 tinged; but a 
poorly constructed text can not only obstruct learning but Ctsn confuse 
or ,di scourage a Jess-than-prof Icient reader. It is by no means obvious 
just what It Is that makes a text a challenge or an obstacle. We have seen 
texts that had^een modified for poor readers by abridging and modifying 

( 
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longer texts, that Impress.ed us as harder, not easier, than the.or Iglnal . 

We suspect that these are not Isolated Instances. For example. It Is likely 

that In forcing a complex story or exposition onto the Procrustean bed of 

" ' ' ' ' ' • 

small vocabulary, short sentence length, and short text length, one requires 

the reader to maKe more complex 'Inferences to reconstruct the Intended In- 
formation structure of the text. Consequently, It Is esseiftlal to develop 
tools not only for evaluating the difficulty level, but also for distin- 
gulshing challenging from confusing texts. 

Ideally, then, it would be possible to examine a given text using 
simple repllcable methods of .evaluation to determine not only the level of 

0 

difficulty, but In fact the sources of difficulty. In a given text. But 
such methods simply do not now exist. The state of the art In such matters, 
in particular theoretical work on text properties In linguistics, psychology, 
and artificial Intelligence, just has not reached a state that affords any 
kind of non-subjective methods of analysis for any but the simplest kind of 
text properties, like lexical frequency counts and measures of sentence 

length. ^ 

The main purpose of the research of the Text Analysis Group Is to 
advance the state of theoretical work on text properties. This research i 
bears on that goal in three ♦^ays: first, It allows the sharpening of the 
tools of analysis, and a determination of which methods of analysis are 
likely to be usable and frultfuT; albeit subjective and Intultlye"^ As an 
example, we have not Included given-new or topic-comment analyses^pf the 
texts, since the methods of analysis we have attempted to use turn out to 
beeither Impossible to apply to non-trivial texts, or when applied 
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conststently,, lead to Intuitively absurd 'resul ts,. Such'an outcome shows 
the poverty of the theories with which such methods are' associated, and 
calls for further theoretical work on these'matters. 

Second, this work Is exploratory. In that close analysis of texts for 
several kinds of properties I^s ^Ikely to reveal new classes of phenomena, . 

and to suggest research on their role in text compreherrslon. 

/ 

Third, we are convinced thar the state of theoretical work on text 
properties can only 'be advanced beyond its present state by consideration 
of data bases considerab^ly richer than presently available, Mq^ theoret- 
ical discussions of text properties are based on s i*iglq^ texts , often of 
trivial size. We intend to remedy the situation by collecting a significant 
body of analyzed texts, with which to compare developing theories of text 
properties. The research in this report is a step toward that goal. This 
kind of analysis will be continued, and we are developing sy$.tems for an- 
notating, storing, and retrieving analyses of texts. 

As a consequence of the exploratory nature of the work in this report, 
parts of it are uninterpreted at V^sent, For example^ one cannot know what 
to make of the section on the relation between syntactic and line-end prop- 
/erties until similar analyses can be compiled for a number of texts, and 
cxper i'mental work is done to determine the effects of the differences. 

Lexical Analysis * 
Statis^tlcal Lexical Analysis 

The statistical analysis consists of a word frequency cbunt of the 
words in the text. The first part of this section describes the way the 
frequency counts were calculated. The second part of this section presents 



an alttrnatlve to statistical analyses which treat each orthographic word' 
as a separate unit. This analysis suggests that a text .inay be analyzed by 
semantic lexical unit s, which are semantic units that the reader must com- 
prehend, and which have the property of being psychologically real to the 
reade^ of a text. This section also contains discussion of a number of 
questions concerning the utility of simple word frequency counts. 

Statistical lexical analysis requires decisions about what counts as 
one word (e.g., are contracted forms like don't one word or two?) and about 

» 

what counts as Instances of. the same woi^ for the purposes of frequency 
counts (e.g., are taxi and taxis the same word?). In our analyses, contrac- 
tions were treated as containing two words, and words related by inflectional 
morphology (e.g.^ taxi and taxis ; be, fm, j±, arp, was^, and were ). were 
treated as instances of, the same word. ' 

In the frequency counts, each sequence of letters delimited by either 
spaces or apostrophe was treated as a separate word. That is, no matter 
what the semantic relationship of a lexical item to other words In the 

< 

sentence. It was considered independently of any semantic unit containing 
it. For example, the word for was separated from waTting for , a semantic 
unit in which it is contained, and counted as a separate lexical Item. 

This section treats the uti 1 1 ty of viewing the discourse as consisting 
of more semantically integrated lexical units. There is no known repllcable 
procedure for dividing the sentences into such un i ts — 1 i ngu i s$s debate such 
issues constantly. Nonetheless, it seems that certain kinds of semantic 
analyses of the words in the discourse may have some relevance to the study 
of difficulty of texts. 



Our prfmary concern Is that frequency counts of words are counts of 
fom .\ If frequency counts and counts of average number of Words per sen- , 
tence do correlate with difficulty of a text, these statlstfgal measures 
must only be symptomatic of deeper properties of texts. Below we discuss 
some examples from BABAR and DESERT^ to show the kinds of considerations 
that might go Into a semantic lexical analysis. In the course of this dis- 
CUSS Ion, some of the problems are pointed out which are coTinected with 

c 

simple word form frequency counts. 

One Interesting possibility which the semantic lexical analysis affords 
that the simple word count does not Is that of Tumping together units that 
have the same reference. So, for pronouns. It Is possible to separate the 
he's that refer, for example,- tp Ba bar , e.g., 

S3-3k But Babar can't eat,. He Is thinking about his crown, 
from those he's and hlm 's that refer, for example, to Arthur and. the 
Mu3tache-man, e.g., * 

58-63 Arthur climbs out of the water just as fast as he can. He 
Is air wet. He sees the man with the bag . Arthur cal Is to 
him . But the man does not hear him . He Is walking away. 

It is then possible to count separately the different kinds of reference 
expressions used for the respective referents. Perhaps there will be cor- 
relations between the number of different types of reference to the same 
referent and the difficulty of the material under study. That Is, It may 
be useful to look at the various lexical descriptions, pronominal references, 
various Instances of definite reference, 'e.g. , the man , the man with Babar 's 

# 
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bag , the Mustache-man , and rjjferences with demonstratives, e.g., ^that^ 
Mustache-man . A word frequency count will not give us this kind of Inforrmi^. 
tlon. ' ' 

Word frequncy counts also make no distinction betwjsaff content words 
and homophonous j^&/ds that serve s t ij^c^i^^^-^rnsfi^^ functions. For 
example, in » . * - 

32 Ob you think that all deserts are sandy and flat?, 
from DESERT, the word that is a subordination marker, serving 9 syntactic h| 
function, andshould be identified as ^ch. Further, It should not be 
confused with demonstrative that as in BABAR 

33 "\/e'll find that Mustache-man," 
^ where the demonstrative does have semantic contei^t. 

Word frequency counts likewise confound the so-called "weather i_t^," as 
in DESERT, 

8 In the desert it may not rain for five years, 

where the phrase it. .. rain may well be taken as a single unit, with anaphoric 

non-referential it's as in 

Ik'l^ A man lost in the desert knows his camel will help him. It 
wi 1 1 find a water hole. 

and ■anaphori'c referential J^'s, as in (from BABAR) 

30-31 "I need my crown! " says B^bar. "I must wear it tonight!" 

It: is not obvious that lumping these jlt's together is justified. 

There is still a great need for linguistic research to develop reliable 

criteria for isolating multi-word semantic units. With such criteria it 

would be possible to perform more refined statistical analyses, and to 

determine, for example, whether writers or texts vary significantly in the 
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degree of coincidence between orthographFc and semantic units, and how 
this affects ease of comprehension. 



Semantic Lexical Analysis 

The opening sequence of a book In a beginning reading series, Babar 

Loses his Crown , tells about the Babar family travelling to Paris. 

The Babar family Is jolng to Paris 

Now the Babar family is on the train. 
.The train is coming into Paris.... ^ 
Now they are off the train 
with al 1 their bags. 
The Babar family Is waiting 
for a taxi . 

The taxi takes them to their hotel. 
Celeste and the children walk Inside.... 
In their hotel room.... 

A fair amount of space Is devoted to the journey. The trip could have 



been summed up more briefly, as in 

The Babar family took a train to Paris, taxied to their hotel and 
Wflrnt Inside. In their hotel room... 



can be abridged and compressed Into one ver4> (e.g., travel led ) . How do these 

choices affect comprehension? 

Stories tell about happeni ngs--about actions, events, processes, states, 
relationships. In this section we discuss the semantics of verbs, which 



"which verb meaning distinctions can be expressed and discuss its psycho- 
logical interpretation. Then, this system Is applied to :ine sample text 



or even 




The Babar family travelled to Paris. In thyir hotel room,... 
The sequence of locations, pathways, and methods of transportation can be 
presented In a detailed fashion, with the states and events spelled out, or 



convey much of this information. 



First, we describe a notational system in 
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above. Next, we discuss the ways In which verb semantic structure may 
affect comprehension, and describe some research in the area. Finally, we 
suggest some line^ along which further research rpight proceed. 

1 • Representation of Verb Meaning 

There are many formal systems for representation of verb meaning. These 
models of verb meaning differ from one another iyti detail, but there is wide- 
spread agreemeiit on the iJea that verb meanings can be represented in terms 
of interrelated sets of subpred ica tes , such as CAUSE or CHANGE. These sub** 
predicates express the inferences that are normally made when the verb fs 
used. These Inferences are highly interrelated, and the representation of 
verb meaning must indicate the relationships among the sets of subpredicates. 
The notation used here is a network format developed by the LNR Research 
Group. In this representation system, both verb meanings and events are^ 
expressed In the same terms--as states, changes of state, actionals, 
etc. 

2. The Elements of Verb Meaning 

When people talk about happenings in the world, they distinguish several 
types of conceptual possibilities. The simplest kind of relationship is the 
state . A stative predicate conveys a relationship that endures In time 
between two arguments, normally an object (or person) and a value within 
the conceptual field specified by the stative. For example, consider the • 
sentence shown in Figure 1, 
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Cadillac 



subject 




Ida 



September 



March 



Figure 1 



Ida owned a Cadillac from September 
to Marcli. 



TI:o verb own conveys that a relationship of possession existed between 
Ida and the Cadillac for some duration. Here the state is one of posses- 
sion. A large number of such states^ including location ( to be at ^ to 
remain at » etc.) and various emotions ( to hate , to love ^ etc.) can appear 
in verb meaning. Stative concepts are expressed not only by verbs^ but 
also by prepositions and adjectives. 
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In addition to simple statlve relationships, verbs, unlike adjectives 
and prepositions, can be used to convey changes of state. Following Chafe 
(1970), we Will refer to this kind of change of state as a process . For 
example, the sentence 

Sam. received a rose* 
tells us that a change of possession occurred such that an earlier state 
in which a rose was possessed by some unknown person changed to a state 
in which Sam possessed the rose. This is shown in Figure 2, 




Figure 2 



Sam received a rose. 



More ccxnmonly, verbs express not simple changes of state but causal 
changes of state. An agent may cause a change of state that relates to 
another object, or the agent himself (or herself) may be the experlencer 
of the change of state. The Ideational verb move can be used In either 
way, as In the following examples: 

a, Ida moved the car* 

b« Ida moved to the front seat. 
The representations of these sentences are given In Figures 3a and 3b. 




1 

.4. / 
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Ida 




Ida 



rigure 3b 



Ida moved to the front seat. 



. A ^ • ■ ■ -\ 

Notice that In both these cases the precise activity engaged In by 
Ida is unspecified. However, there are also a great many English verbs in 
which the causal action i s partially .or whol ly specified, for example, 
among the location verbs: walk , saunter , meander, stride ,, run, sprint , 
race , trot, job. (See Miller (1972) for a more extensive discussion of the 
verbs of location.) At present we know of ho formal representational nota- 
tion capable of dealing with physical actions. Miller (1972) and Miller 
and Johnson-Laird (19/6) have di scussed the verbs of motion, and Greeno 
and Centner have developed some tentative representations of some verbs 
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of mlxlr>g (Centner, 1978). But neither of these attempts goes far 
enough to capture the richness of physical knowledge that we use In dis- 
tinguishing, for sxample, sauntering from striding . Figure k shows the 
representatives of the verbs mix and stir developed by Greeno and Centner. 

■> - 
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StI r Is a pure actional verb. There Is no specification of the change'of-* 
state that should occur. Only the action Is specified. Stirring may be 
done In order to change homogeneity, as when stirring sugar in coffee, or 
it may be done to change temperature, as when stirring too-hot black coffee, 
or it may be done to prevent something from burning. Mixing is just the 
opposite. Here the process (change of state) is specified to be a change 
of homogeneity. The action by which this change Is produced can be stirring, 
shaking, using a blender, or whatever else works* 

An event can be a change of state, a causal change of state, an 
action, or a concatenation of events. A detailed discussion of the notions 
of states and events as they are used in the LNR system is given by Rumelhart 
and Norman (1975)« Further discussion of verb semantics can be found in the 
articles by Abrahamson (1975) » Centner (1975) f Munro (1975)* and Rumelhart 
and Levin (1975) in the same volume. 

3. Psychological Implications 

The intention in writing out verb representations is to capture the 
set of Immediate inferences that people normal 1y make when they hear or 
read a sentence containing the verb. The system is decompos! t ional , in 
that it is assumed that these networks of meaning components are substituted 
for the verbs during comprehension. This substitution process is assumed 
to be immediate and largely autonjatic, and it is assumed that the set of 
components associated v/ith a given v/ord is reasonably stable across tasks 
and contexts. 
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The representations should satisfy some psychological criteria. First 
they should accurately capture people's internal meanings. Thus, we should 
find that the representations agree with people's intuitive nbtions of 
synonymity and similarity in meaning (see Rumelhart and Norman, 1975) • One 
measure of this overlap is the likelihood that people will confuse words 

.'1 

in memory. * In an experiment in sentence memory, using verbs varying 
semantic overlap. Centner found that subjects did 'ndeed confute the verbs.^ 
In exactly the way predicted by the theory (Centner, 1975) • The 
correlation between the number of confusions subjects made between two 
verbs apd the semantic overlap between the verbs, as predicted from the 
representations, was quite high. 

In fact, the correlation between representational overlap and number 
of confusions was slightly higher, though not significantly so, than the 
correlation between the number of confusions and the rated similarity 
between the verbs (the similarity ratings were generated by a different 
set of subjects) . 

Another psychological criterion is that the representations should 
be at the correct level of completeness. That Is, the representational 
structure for a given verb should show the almost-inevitable Inferences 
that are made when a verb is used, but should, not show ex%reme1y rare 

inferences that can be made in only a few contexts. These must be derived 

^ I. 

from the interaction between the representation of the verb and that of 
the contexts. This brings us to another requirement. 

Another important psychplogical requirement is upward-compatibility. 
The basic notions of state, change of state, cause, and so on must be 
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combtnable Into networks larger than the Individual sentence. The Infer- 
ences derived from words decomposing Into -semantic components must Inter- 
act with Information from the context. Sometimes some of the bottom-up 

% 

Inferences, represented by subpredlcates , w^t 1 1 be overridden by top-down 
contextual Information, as In metaphorical uses of verbs. But, although 
high-level Inferences and context-based expectations are undoubtedly an 
Important part of meaning processing, still this top*down knowledge must 
Itself be based In part on the bottom-up knowledge derived from the Indi- 
vidual word meanings. Ultimately, It will be necessary to model word 
meanings, high-level structures, and the processes that bridge between 
them. In the examples In this paper, we use only fairly simple rules of 
combination to Join sentence representations. 

Another psychological requirement Is that our representations should 
be able to capture the way In which word meanings are learned. Indeed, 
children's acquisition of the verbs of possession fits very nicely with the 
predictions generated from t1ie model, If one makes, the Clark (1973) assump- 
tion of gradual acquisition of semantic components (Geiitner, 1.973) • 

With these psychological Issues In mind, we can then dIsCuss the means 
by which a text achieves the desired structure, and consider some of the 
Issues relevant tp comprehension. 

Semantic Structure and Comprehenflb i 1 i ty 
a. Semantic Function ' » ^ 

A fundamental distinction Is whether a word is used as a predicate 
(e.g., awerb) or an argument (e.g., a Concrete noun). There are two 
opposing lines of argument as to which of these functions most affects text 



understand! n<9* Proper nouns and concrete nouns^ the quintessential argu*- 
ment terms, are relatively inert semantical 1y. They function as pointers 
to the world, but their semantic structures perform no connective function. 
In contrast, the semantic structure of a verb serves to connect the noun 
concepts into a unified >roposi tion. By this line of reasoning, failure 
to comprehend the verb should therefore lead to more confusion and more 
disturbance In overall understanding pf the ^ext than failure to comprehend 
a noun. In terms of the schematic diagrams shown In thn figures, failure 
to comprehentl a not^n means that the language user must store an empty or 
Incorrect node In one of the argument slots. Failure to comprehepd the 
verb could leave the language user with an unorganized heap of nouns that « 
cannot be connected with one another or Integrated with the rest of the 
text . 

Preliminary evidence indicates that' verb meanings are altered more 
than noun meanings when subjects try to malce sense out of bizarre sei^tentes 
lll(e The lizard worshipped .. Further investigation might inlcude protpcol 
analysl^s of what happens when children encounter unknown or only partially 
known words in various semantic roles. The results mi^^ht indicate whaj^ 
kinds of wordi in a text must be taught most carefully in order to ^Void 
serious confusion. i 

b. Case Structure > 

- ■ ' \\ 

A clear distinction among verbs Is how. many arguments they take.' Some 
transitive verbs, such as sell, can take four noun arguments (the buyer, 
the seller, the object, and the payment) In addition to the time-of- 
occurrence. At the other extreme, statlve verbs take only two noun 
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arguments (an object and a value) along with the duration (initial and 
final times). One issue to investigate is whetlier a greater- number of'' 
many-place predicates in a text leads to mor« difficulty in comprehension. 

, ■ , 'I t 

c. Semantic Complexity o 

Another relevant. variable^ is semantic complexity: the number of under- 
lying components that make up the basic meaning of a word. More complex 
meanings correspond to more specific references to objects or events. For 
example, stride is more specific than 30. Its meaning contains more sub- 
predicates. We know more having heard sentence (a) 'than sentence (b). 

(a) Ida strode across the field. 

(b) Ida went across the field. . . 

- Various researchers have proposed that semant ic comrlexi ty may affect com- 
prehensibi 1 ity, generally on the assumption that more complex semantic 
structures are harder to process. However, studies performed on adults 
have shown no evidence that more complex words lead either to longer re- 
action times or to greater processing loads^than do simpler >words (Klntsch, 

.1974),. . . 

There'may be more relatfonshlp between complexity and difficulty in 
children than has been found in adults. Research in child language has 
shown that young children often fall to comprehend the' full meanings of 
complex terms (e.g., Bowerman, 1975; Clark, 1973; Centner, 1975). 
Working with the verbs of possession, we have observed that chlli|dren 
•act out the simple verbs give and take correctly before they act out the 
complex verbs sell and buy . Further, when young children (around 5 years)^ 
are aske.d to act out sel 1 (as In "Make Ernie sel l Bert a boat."), they act 
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out give instead (a boat Is transferred from Ernie to Bert); the same Is 
true for buy and take . Our Interpretation, consistent with Clark's (1973) 
semantic features analysis, Is that . chl Idren correctly act out the parts 
of a word' s meaning that they are conceptually able to comprehend. At any 
given time, the child uses only the components that he h«is so far acquired 
in comprehending language. 

This being the case, the next question is what effect the use of buy 
and seH rather than take and give has on children's comprehension of text. 
The young child, having read that Ernie bought a car, may understand that 
Ernie acquired possession of a car, but not that he did so by means of an 
agreement with someone else by which Ernie was obligated to give sotne money 
in return for the car. Thus, it seems inevitable that comprehension of the 
immediate inferences intended in the sentence will be. affected. We can 
then ask how this local lack of complete comprehension of buy wi 1 1 affect 
the overall comprehension of the story. Clearly two extreme cases need to 
be distinguished: those In which the contractual notions involved in 
buying are important to the sense of the whole text and those in which they 
are not. In the former case, failure to comprehend those parts of the 
meaning of buy wi 1 1 certainly have a pervasive effect on comprehension. In 
the. latter case, there is at least a chance that children can use those 
parts of the meaning of buy that they understand and leave the rest aside 
without its affecting their overall comprehension of the text. 

One avenue of research that this suggests would be to examine chil- 
dren's comprehension of text that includes either simple or complex word 
meanings. These complexity variations should ccur either in the noun 
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position or In the verb position. Table 1 (following page) outlines an 
experiment designed to Investigate these Issues on several levels and with 
various measures. 

* 

d. Semantic Complexity and Semantic Connectivity . 

So far we have considered the possi bl 1 1 ty^that an Increase In diffi- 
culty of comprehension might be caused by use of semantical ly complex terms. 
On the other hand, the additional semantic components In a complex verb may 
.set up additional connections atnong the nouns In the sentence. In this 
case, the more complex verb could lead to a richer and more highly Inter- 
woven text representation, and thus to better memory for the text. Centner 
found this kind of Improvement In connectivity In a series of experiments 
In sentence memory (Centner, 1978). Subjects were given sentences that 
differed In the semantic complexity of their verbs, such as the following 
pai r of sentences: 

(simple) Ida gave her tenants a cloclc. 
(complex) Ida sold her neighbor some art' posters. 
They were better able to recall the recipient nouns ( tenants and neighbors ) 
when a complex connective verb such as sell was used than when a simple 
verb such as give was used. (Object-verb pairings were, of course, 
counterbalanced.) 

It Is crucial for this effect that the additional semantic Information 
In the complex verb is such as to connect the nouns In question. This Is 
clearly true In the case of sell versus give , as can be seen In Figures 5a 
and 5b. 
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Table 1 



An experiment to test the effects of verb semantic structure on reading 

Purpose: to see whether' semantl c complexity and connectivity have effects 
on hlgh**1eve1 reading performance. 



Independent Variables 



Level s 



Semantic complexity 



Semantic function 



Semantic connectivity 



Amount of high-level 
structure t 

Reader expertise 



semantical ly complex words versus 
semantical ly simple words 

predicate (verb) versus argument 
(simple noun) 

connective complexity In predicates 
versus nonconnect ive complexity 

strong story constraints versus 
loose story constraints 

poor readers versus good* readers at 
same age 

younger children versus older children 



Dependent Variables 



Ease of initial comprehension 



Later memory 



time to read story 

errors in reading out loud 

accuracy In immediate questioning 

abi 1 i ty to retel 1 story^ 

ability to recognize parts of the story 



NOTES 



Memory, for material will be assessed both by verbation accuracy and 
by correctness of content. 
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Ida gave her tenants a clock 
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GENERAL VERB (FEW CONNECTING PATHS) 



Figure 5a 
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Ida sold her tenants a clock 



Event 





Ida tenants Tcause) 



(cause; • 
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SPECIFIC VERB (MANY CONNECTING PATHS) 



Figure 5b 
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Ida mailed her tenants a clock 




SPECIFIC VERB (FEW CONNECTING PATHS) 



Figure 5c 
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In some complex verbs, however, the additional semantic material 
merely amplifies the simple verb In a manner that does not add connections 
between the nouns. For example, the verb mai I , whose representation* is 
shown In Figure 3c, adds the information that the method of transfer was 
mai I Ing or some similar form of long-distance transfer. It Is this infor- 
mation that makes the event description more specific than a general act 
of giving. However, the knowl edge that the object was mailed leads to 
few, if any, additional connections between the agent noun ( Ida) and the 
recipient noun (tenants). Therefore, the prediction was that use of such 
non-connecting specific verbs would lead to no improvement over use of 
general verbs in memory between the nouns. 

The results were exactly as predicted: The object nouns of connecting 
specific verbs were recalled better than those of general verbs and non- 
connecting specific verbs. Thus connectivity is beneficial to sentence 
memory. 

The experiment out I ined in Table I, in aod;*. v:,ii to testing whether 
complexity leads to difficulty, can be used here to measure the effects of 
connectivity. By varying not only the complexity of the verbs but also 
their connectivity, we can ask whether additional connectivity leads to 
Improvement in memory for a text. It may be that connectivity has other 
beneficial effects as well: that the vividness and aesthetic value of a 
passage Is In part due to the judicious use of highly connective predicates. 

We may expect these effects to be fairly age-sensitive. If only 
because young children may lack understanding of some of the connective 
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components of word meaning. For example, at the age when the verb sel 1 Is 
comprehended as having the same meaning as give , It seems likely that use 
of sel 1 leads to no Improvement In text memory over use of give. Another 
point that must be made Is that the effects of connectivity arise from 
specific representational structures. Which nouns wl 1 1^ be held together 
by a given verb must be predicted from the representational structure of 
the sentence (or, for example. In Figure 5b note that salt creates many 
connections between jd£ and tenants , but adds haruly any extra connections 
beyond that of give between Jda_ and clock) . This line of prediction Is 
thus potentially much more precise than an approach that simply attempts to 
divide verbs Into connective and non-con,iectlve groups, or worse. Into com- 
plex and simple groups, without considering the semantic function of the 
complexity. 

e. Semantic IntegrabI 1 1 ty 

Another factor likely to be Important In text processing I* the clarity 
with which the sentence meanings fit together. If the semantic structure 
set up by the verbs [n the sentences are such that the connections betweer. 
them are transparent—for example. If the sentence representations share 
common components— then the text should be smoother and easier to 
understand. 

We have only begun to Investigate the role of semantic connectivity 
In larger texts. It has been shown that semantic Integration among under- 
lying subpredlcates can be systematically created (Centner, 1975). In 
this study, a general verb, such as give , was presented in a passage that 
contained additional semantic information, such as the fact that the glver^ 

3} 
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actually owed the money he was giving. The Integration of the semantic 
components of the context with those of the verb was hypothesized to pro- " 
duce a mone complex structure — in this case, the structure of pay . As pre- 
dicted, subjects hearing the extra mi|teria1 falsely recalled the verb which 
best fit the composite structure, rather thUn the verb actually presented. 

V 

5. Application to a Sample Text 

With the foregoing discussion In mindp let us re-examine the passage 
from BABAR cited at the beginning of this section: 

a. The Babar family Is going to Paris. 

b. Now the Babar family Is on the train. 

c. The train Is coming Into Paris. 

d. Now they are off the train with alV their bags. 

e. The Babar family Is waiting for a taxi. 

f. The taxi takes them to their hotel. ^ 

g. Celeste and the children walk Inside. 

h. In their hotel room ... 

Figures 6a through 6g show a sentence-by-sentence analysis of the repre- 
sentational structures set up by the locatlonal verbs In the passage. (We 
consider only the locatlonal Information, although the discussion could be 
extended to other Inferences In the passage.) 



35 




Time information is not shown, since all information in the text 
is given in the present tense. FcJr slinplicity of presentation, in 
the remaining; Figures (feo-Gg) the infonnation that the initial location 
is not the saine as the final location will be abb related by a 
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FIGURE 6b. . 

Now the Babar family Is on the train. 




FIGURE 6c. 

Ihe train Is caning Into Paris. 
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FIGURE 6g. ' . 

Celeste and the .chi;dren walk inside. 
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Figures 7a and 7b show how these individual sentences can be combined 
at the intersentence level. This, of course, is the crucial process that 
allows readers to mal'.e sense out of passages of prose. 
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FIGURE 7a. 

Tracking theBabar Family 



Sentences b and c give the implication dn the right. 

Note: Dotted lines in the lefthand figure indicate intersentence connections. 
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Figure 7b. 

Tracking th« Babar Family (continued) 

Sentences b and c combined with sentences d e and £. 

At this point the implication can be made that the Babar 

family is at their hotel in Paris, fulfilling the travel plan 
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An example of lnco<npr»hentlbH Ity; The India n Occupation. The Indlen 
Occupation Is a story written to help poor readers obta^ln practice with words 
that have certain patterns of spelling. It Is a classic example of the kind 
of confusion that results when a writer considers on,1y the lower levels (In 
this case, the orthographic level) and Ignores the higher levels of text 
structure. To give the reader some fisellng for the problem, we reproduce the 
entire passage: 

The Indians had not heard from'the government. The suit for Alcatraz 
was stMl not settled. The Indians were discouraged and angry. They 
did not icnow If their goal could be reached. Some peopVe wanted to 
tear down the buildings. "The white man Is our foe," they said. "He 
toolc our land 300 years ago. It's true! The white man wrote treaties, 
but they were all a hoax." Other Indians said, "Waif! Vfe must build 
a place here that we can boast about. We must have a school. It's 
dangerous for our children to roam through these old buildings. We 
need food too. We must hoe the soil and plan tomatoes, potatoes, and 
fruit." Suddenly someone roared, "Firel Flr^A" A fire had started In 
an old building. Unfortunately, the boards made good fuel. The flames 
soared high. There was no water to soalt the buildings. The only water 
on Alcatraz was the drinlcing water brought by the boats. The Indians 
had no pumps to bring water out of the bay. Finally, the roaring fire 
was reduced to coals and burned itself out. There were no clues to 
tell how it started. Other problems came up. Food and water did not 
come when they were due. Boats cruised by, but they didn't stop at the 
island. Some of the Indians began to loaf. They forgot about their 
oath to work together. Richard Oakes decided to leave the Island. 
Others said, "If he goes, we'll go too." Nobody could coax them to stay. 
Now the Indians had other" foes. Time was their enemy. They had to hoard 
food and water. And there was no power or coal. The Indian people 
needed warm coats. No one would loan them money. The school had to be 
closed. Their nurse left the island. And the Indians still had not 
heard If Alcatraz could be theirs. Some of the Indians forgot their true 
goal. They argued. They fought over many Issues. They roamed the 
island wondering what to do. One day a boat cruised up to move them off 
the Island. Their dream had failed. 

The passage seems very difficult; one finds oneself rereading 
sentences and skipping backwa^rds to find out whether one has missed prior 
information. Why? It is not only because of references which come out 
of nowhere, such as: Richard Oakes decided to leave the island. Another 
reason for the difficulty may be the lack of overlap between the semantic 
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structures In adjacent sentences. For example, consider the passage that 
begins 

Other problems came up. . 
The ne"* sentence Is 

Food and water did not come when they wer^^ue. 
The verb come refers to a change of location of food and water to the 
Island. However, the next sentence 

Boats cruised by, but they didn't stop at the Island, 
refers to a distinct. change of location: the paths of boats past the Island. 
On second reading, the connection becomes clear: It would have been desir- 
able for the boats to have stopped at the Island, because they might have 
been carrying food and water. But none of this Is transparent, and none of 
It Is accessible without considerable knowledge of the world. The next sen- 
tence Is 

Some of the Indians began to loaf. 
This has even less connection with the text than the previous sentences. A 
canny reader, assuming that there must be a connection, can reason out that 
the Indians are loafing becapse they are discouraged, because there are many 
problems, and so on; but again there Is no semantic overlap to help the 
reader with these Inferences, and so again comprehension of the Story depends 
on considerable reading expertise. 

Concluding Remarks 
Semantic structure is only one level of text structure; for a story to 
be comprehensible requires much more than mere sentence-to-sentence connec- 
tions. Indeed, text with good Intersentence connections but poor overall 
structure sounds glib and shallow, like an insincere political speech. Yet, 
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the semantic structures of sentences are the building blocks that make up 
higher-level structures, and from which higher-level structures are de- 
rived. Semantic connectivity Is clearly an Important factor In siory com- 
prehension. Further research will help to define Its role more precisely, 
to suggest the kinds of connectivity that are most helpful, and to mak^ 
clear the connections between sentence structures and higher-level 
structures. , ' 

« 

III. Syntactic Analysis 
/V. Introduction 

Syntactic analysis Is taken to Include the enumeration, display, and 
^ discussion of syntactic properties of texts, or more strictly, of the sen- 
tences constituting texts. This entails, at the very least, a parsing (at 
least one level) of each of the sentences In a text and a transformational 
history-bf each. The rationale for parsing Is discussed In section B, 
along with a description of the Information provided by the parsing. 
Section C describes the transformational history analysis and Its rele- 
vance to reading comprehension. 

An Important consideration of syntactic analysis is a measure of syn- 
tactic complexity. The difficulties Inherent In arriving at a satisfactory 
measure of syntactic complexity are discussed In section D. 

A further syntactic characteristic of texts that may affect comprehen- 
sion Is the extent to which the diction employed In the text differs from 
the conversational usage with which a child may be expected to be familiar. 
Section E treats the analysis of differences between oral and written 
language. 

( 
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Parsing 

Texts are parsed Into major syntactic constituents, relational 1y de- 
fined (e.g. In terms like subject, verb, direct object, and Indirect object, 

rather than purely syntactic terms like NP and VP). The following Informa- 

.. * 

tlon Is Included In the parsings: 

a. Position of line breaks with respect to the major consti- 
tuents. This is of relevance to reading comprehension Insofar 
as line breaks might hinder the syntactic processing of sen- 
tences they Interrupt. One might speculate that they would 
slow processing less when they come between major constituents, 
such as between the subject and the predicate, and more when 
they come between a determiner . and Its noun In the object of 

a preposition. On the other hand. If It Is easier to antici- 
pate what will follow a determiner than what will follow a 
subject noun phrase, then line breaks that Interrupt fine de- 
tails of structure might require the same amount of processing 
time, or the breaks between major constituents might slow 
processing more. 

b. Indication of whether the sentence deviates from the sub- 
ject-Initial order whjch Is normal for English sentences. If 
this "deviant" order Is normal for that syntactic type of sen- 
tence (e.g. In questions), this Is also Indicated. This In- 
formation Is Indicated on the assumption that such sentences 
might affect reading comprehension by disrupting syntactic 
processing. T* Is Is most likely to be true for the rather' 
small subset of such sentences which excludes questions, quo- 
tation preposlngs, and there- Insertion sentences, as well 

as adverb preposlngs which are not accompanied by subject- 
verb Inversion, since such adverb preposlngs often contri- 
bute to the connectedness of the text by putting hi sentence- 
Initial position a connective which relates the sentence 
following It to text which preceded It. (This may be seen 
^ fairly clearly In the parsing of BABAR.) However, In prac- 
tice It turns out to be difficult In many cases to determine 
whether a "deviant" order should count as "normaV for the 
context or not. This Is discussed In more detail In sec- 
tion D ( Syntactic Complexity of Texts) . 

c. Number of clauses or clause-remnants per sentence. This 
Is of relevance Insofar as It may provide a measure of syntac- 
tic complexity. See section D for further discussion. 

d. Indication of whether the sentence being analyzed Is a dl* 
rect quotation. This applies only to narrative texts. It Is 
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relevant Information In that one might want to be able \ 
to segment the text . Into the characters' sentences and 
those of the narrator, In order to analyze them separately.. 
Quoted speech may differ syntactically from connecting 
narrative, and may require different kinds of Inferences, 
or more Inferences, or both. It dertalnly should reflect 
oral language more and written language less than the non- 
quote portions. See Section E ( Oral -Wr 1 1 ten Language 
Differences) and Hermon (1979) for farther discussion. 

e. Comments describing 1) the eJcact position of line 
breaks which Interrupt major constituents 2) the Inter- 
nal structure of constituents Interrupted by other major 
constituents (only a few cases of this exist In the texts 
examined), and 3) the discourse function of elements Con- 
tributing to "deviant" order, as described above. Such 
elements are usually preposed adverblals, less often post- 
posed subjects. 



Transformational History 

The parsings are followed by a sentence-by-sentence analysis of the 
transformations Involved In the derivation of each of the sentences In 
the texts. Such an analysis necessarily presupposes a particular theory 
of transformational grammar.* In compiling the lists of transformations 
we have tried to Include only transformations which the most generally 
accepted versions of transformational grammar would Include. That Is, we 
hav« listed only relatively uncontroverslal transformations which affect 
major syntactic constituents, and have avoided discussion of such de- 
tailed and relatively uncharted areas as the derivation of possessive 
constructions and pre-determlners. 

This Information Is relevant to reading comprehension Insofar as trans- 
formations contribute to difficulty or ease of syntactic processing. There 
are two ways In which they might do this. First, It could conceivably be 
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the c«se that certain transformations contribute to the generation of struc- 
ture! which are unfamiliar to the reader, or which for other reasons present 
processing difficulties. We have In mind here such transformations as Rela- 
tlve Clause Ex traposi tl on, which moves a relative clause from the noun phrase 
it modifies to the end of the clause that the noun phrase occurs In. This 
particular rule Is likely to present processing difficulties for the young 
reader for both of the reasons mentioned above; the resultant construction 
Is one which the young reader may never have heard, and It separates syntac- 
tic units which are very closely bound semantical ly. Reuniting them may be 
a relatively difficult task for a young child. If we had firm knowledge of 
which transformations were responsible for structures which would be likely 
to cause processing difficulties, then an Index of the transformations In- 
volved In the generation of a text could provide the data base for computing 
a measure of difficulty for the text. Such Information could. It wouid seem. 



Syntactic Complexity of Texts 

Several methods of calculating syntactic complexity were employed In 
the first analysis (BABAR and DESERT). None was satisfactory. The first 
were measures which refer to the number of clauses or verb-containing clause 
remnants per sentence, with sentence defined as material beginning with a 
capital letter and ending with a full stop (period, question mark, or excla- 
mation point). Clause , as used here, refers not only to Independent 
clauses, subordinate complement clauses, and relative clauses, but also to 
any clause remnant which contains a finite, Infinitive, or participial verb 




e obtainable from a carefully designed series of experiments. 



form whose objects have the seme form and position relative to the verb 
they would have if their verb was a main verb. Thus, In BABAR there are 
sentences which have no clauses, since there are fragments consisting sIm- 
ply of MPs which are punctuated like sentences, e.g. 115 and 155: 

115. Poor Babarl 

155. The Hustache-manl 
Verb- containing remnants Include Infinitive clauses/phrases which function 
AS objects, edverblals, noun modifiers, etc., and participial modifiers and 
complements. Nominal I zat Ions and preposed adjectives are not Included, on 
the grounds' that the object In such constructions does not have the same 
form or position that It woiild have If Its verb were a finite form. 

Two statistics were calculated: range of verbs (defined as above) per 
sentence and average number of verbs per sentence. On the first measure, 
BABAR and DESERT are nearly equally complex, since the range In BABAR Is 
0-3 and the range In DESERT Is 1-3. On the second measure, DESERT Is con- 
siderably more complex, since the average number of verbs/sentence Is l.'lB, 
whereas In BABAR It Is only slightly more than 1 — 1>05 to be precise. This 
corresponds to our Intuitions about the complexity of these two texts. By 
comparison, the Reader's Digest article from which DESERT was written, a 
piece called "The Challenge of the Desert" (CHALLENGE), had a range of 1-6 
verbs per sentence, and an average of 2.30. 

The measure of complexity provided by these figures Is not fine enough 
to be of much use. While CHALLENGE Is Intuitively quite complex syntactically, 
probably well above the abilities of the average second-grader, books are 
written for young children which do not seem particularly syntactically com- 
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,plex. but which have similar complexity Indices If complexity Is mea- 
sured .this way. For example, Charles M. ■Srhul>»« He's Your Dog. Charlie 
Brown (197*1), hereafter termed DOG, h^s a range of 0-6 verbs per 
sentence, and an average of 2.kk verbs per sentence. The average num- 
ber of verbs per sentence Is even higher than CHALLENGE, although Intui- 
tively DOG seems much easier. 

The number of verbs per Independent clause would seem to offer a 
promise of being a more useful measure of complexity, since It' would 
not treat the number of Independent clauses conjoined In a sentence" as 
contributing to complexity, which verbs/sentence does. However, a com- 
parison of the four* texts for which the calculation was made Indicates 
that verbs/cjause Is no more useful than verbs/sentence. In that both 
make DOG appear even more syntactically complex than CHALLENGE. 

Verbs/sentence Verbs/clause 

BABAR 1.25 1.31 

DESERT 1.^8 : l.>5 

DOG ' 2.i»i» 1-75 

CHALLENGE 2.30 " 1.61 

.Clauses per^T-unlt and T-unlts per sentence (Hunt, 1965) were 
also calculated to see If they would provide an Index of complexity 
that corresponded to our pretheoretlcal Judgments of the relative com- 
plexity of these four texts. But they turned out to be no better than 
any of the other measures. In fact, as Indicated In the chart below, 
where the books aHTTTked In order of Increasing complexity according 



to our ' judgments, the figures for T-untts/sentence Indicate that 
CHALLENGE Is considerably less complex than even BABAR and DESERT, a 
claim we cannot accept as accurate. (T-un Its/sentence Is supposed to be 
Inversely proportional to complexity, and Is supposed to reflect the 
achievement of multi-clause sentences by coordination rather than sab- 

t 

ordination.) Clearly this ratio can only be sensibly Interpreted In 

-c, 

conjunction with some measure of sentence length In terms of words or 

T 

clauses. But as It turns out> on jL:^ measures of sentence length In; 
terms of words per T-unIt or words per sentence! provide Indices of 
complexity which rank the four texts In the same^order as we did. Of i 
these, wordsi^sentence appears to be truer to oijr judgments of the rela- 
tive distance between the texts, contradicting Hunt's claim of supe- 
rlorlty for words/T-unit as a more accurate measure of maturity of 
style than words/sentence. Needless to say, such measures give no In- 
dicatlon of the source pf the complexity that they Indicate. 

Clauses/T-unIt T-un Its/sentence wds/TrunIt wds/sentence 



BAB^R 


1.19 


1 .03 


6.97 


6.58 


" DESERT 


1.13 


1 .02 „ 


S.k9 


8.56 


DOG 


1.32^ 


1 .30 


1 1 .15 


12.91 


CHALLENGE 


1.30 ' 


1 .22 


13.50 


16. 113 



/■ 



(Interestingly, the clauses/T-unI ts ratio for CHALLENGE Is, identical 
to that found by Hunt In writing samples from fourth graders. The 
twelfth graders whose writing he examined averaged 1.68 clauses/T-unl t. ) 

Among the syntactic constructions contributing. In our opinion, 
to the complexity of CHALLENGE which are overlooked by these measure* 
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are: 1) Setitences with adverbial phrases as subject: From dawn to 
dusk In the sun can finish him ., 2) Sfsntences where the subject Is In- 
verted after a preposed locative adverb: Yet In everv desert are living 
things— plants, animals, human beings— which defy the dictionary and 
the desert Itself . Today around his villa are green lawns with foun- 
tains playing . 3) Sentences with dangling pre- and post-mod If Icrs. 
(We do not claim that these are ungrammat leal , or evenr poor style, 
only that their correct Interpretation Is not transparent.) Some 
examp ] es : Following the Nile's course for 560 miles, from near Luxor 
to the Delta. It Is six, miles wide . A species of frog In the Austra- 
lian desert stores water In Its abdominal cavity, bloating Itself 
until It Is spherical in shape . The world's largest, the Sahara. Is 
advancing on a 2000-mile front. In som »» places as much as 30 miles a 
year. This fringe Is invarled by nomadic herdsmen, forced out of the 
desert. ^) Sentences where the use of semicolons Indicates, a con- 
nection between Ideas, but dc^s not malce that connection explicit, as 
in these passages* In the first, the second clause Is an exempllflca- 
tion of the claim of efficiency made In the first. But it is not clear 
even to us what the connection Is supposed to be In the second passage, 
and there can be little doubt that the syntactic juxtaposition of 
clauses without explanation (or even a hint of the motivation for juxta- 
position) causes difficulty. The cactus has an efficient water-storage 
system In Its thicic stem; one species of tree cactus may contain hun- 
dreds of gal Ions . Egypt once was a semi-desert; then In the seventh 
century the invading Arabs brought In great herds of camels, sheep 
and goat s. 

rr 
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Another possible measure of complexity refers to the proportion of 
sentences CSTlIch deviate from the canonical subject-first order character- 
istic of English. Deviant sentences, defined this way, Include questions 
and sentences containing quotations and preposed adverbs, regardless of 
whether the subject Is Inverted to follow the verb or not. Thirty-one 
percent of th* sentences In BABAR and thirty-seven percent of the sentences 
In DESERT have non-canonical word order. But percent of sentences with 
non-canonical word order Is a very rough measure since many such sentences 

are syntactically unmarked In the sense of psychological marking. For 
f ' 

V example, questions are pragmatically unmarked (In the same sense), In that 
the non-deviant order of constituents would make the text bizarre or 
Incoherent. 

Defining ''deviant" Is the major problem with this measure of com- 
plexity. The strictest criterion ^or '^deviant" order— that an apparently 
deviant order of constituents in a sentence should be considered hatural 
only if the sentence would be ungrammat leal with any other order--is too 
strict to be useful. In addition to labelling as deviant Inverted 
quotation-containing sentences like " Don't worry," says Celeste , It does 
not allow one to consider questions with the order Objcct-Verb-Subjcct , 
such as sentence 20 in BABAR: 
20. What Is this? 
to be natural, given the existence of nonsynonymous grammatical -echo- 
questions with the order Adverb-Verb-Subject like Where is he ? Further- 
more, It would categorize sentences which begin with a connective or deic- 
tic adverb like so, then, later, etc. as deviant if the adverb would be 



grammatical In some later position, 



.« the fact that In such a posi- 



tion It would detract from the connectedness of, the text. 

One might suppose that .the number of transformations In the deriva- 
tional history of a sentence would provide an ,\ndex, of Its syntactic com- 



plexity j but this Is unlikely to be true, for the simple reason that while , 
some transformations, such as Passive, Gapping, and Niching, may very well 
contribute to syntactic complexity and difficulty of processing, others, ,^ 
like Pronominal I zati on, Equl-NP-deletlon, and WH-movement, most likely do 
not have this effect. To utilize the transformational history of a sentence 
to provide an Index of its complexity, one would need fairly certain knowl- 
edge about which transformations contribute to syntactic complexity and 
which ones reduce It or are neutral. We do not have such knowledge at this 
time; It may turn out that transformations do not have any absolute complex- 
ity coefficient. It may be. Instead, that whether a certain transformation 
contributes to the complexity of a certain sentence will depend on the dis- 
course function that sentence Is Intended to serve (cf. Green, 1978). This 
is a burgeoning field of research, but there are no firm results as yet. 

Oral -Written Language Differences 

This section treats the analysis of differences between the language 
of written texts and the language of speech. Such differences may affect 



reading comprehension In at least two distinct ways. Flt^t, during the 
period before decoding proficiency Is complete, texts In a specifically 
writ ten- language style may present what amounts to the added burden of 
learning a foreign dialect at the same time the learners are. supposed to be 
mastering the relation between the language they are already familiar with and 




50 

classes of marks on paper. The langauge of written texts may be assumed 
to be unfamiliar and foreign to children to the extent that they were not , 
read to regularly as preschoolers. Children who come across unfamiliar words 
br constructions before decoding Is mastered may not know whether to doubt 
the familiarity of tHe form, or their own mastery of the sound-grapheme cor- 
respondences they are supposed to be learning. We speculate that children who 
have been read to extensively as preschoolers, and are at least passively 
familiar with the' language of written texts, will not be much bothered by 
the differences between oral and written language, but that Instances of 
specifically written language In texts may provide a source of difficulty 
for children who are relatively unfami 1 lar with books and written stories. 

Second, after decoding proficiency Is attained, too much oral = language 
In a text or oral language In unexpected places may slow comprehension or 
reduce It In two ways. In confounding the reader's expectations of exclu- 
sively written language, there may simply be a jarring effect of Inter- 
rupting the cognitive processes which contribute to comprehension. The 
result of this, we assume, would show up largely as slower comprehension 
rates. But a child might also be distracted In a more serious way— the 
child may begin to look for an Impllcature of this Inappropriate style and 
become sidetracked, wondering why the text was written as If It was someone 
talking. 

Carefully designed experiments with readers of various proficiency 
levels and backgrounds should be performed to test these hypotheses, since 
they have clear and relatively easily Implemented Implications for the 
wrjting and editing of texts for different types of readers. Among the 
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things that would have to be controlled In such experiments are the native 
dialect of the reader and the reader's familiarity with the language of 
books. 

Analysis of Text- Level Properties ,^ ' . 

This section contains discussion of the analysis of text-level prop- 
erties of texts which may affect reading comprehension. Section A treats 
the analysis of the organizational structure of texts, and contains com- 
jments on how this structure Is Indicated overtly to the reader, ahd how 
well It Is Indicated. Section B Is devoted to discussion of a number of 
text-level discpurse properties. These Include Information structures, 
rhetorical structures, and connective devices. Including anaphoric (pro- ^ 

nominal) devices. ' . \ 

. \ 

\ 

Text Structure \ 
1 . Introduction 

The structural analysis of texts can be carried out at a number of 
levels, depending on the length and type of text. A richly structured 
text such as BABAR could possibly bear analysis at as many as four or five 
levels of organization. In the absence of a general and empirically 
vulnerable theory of the structure of texts (cf. Morgan S Sellner, In press, 
for discussion), It has been necessary to turn to pre-theoretlcal analyses 
and analyses restricted to certain literary genres like the Plans analysis 
described In Section IV. A. 2., and the Story Grammar, discussed in Section 
IV. A. 3., which are both limited to analysis of narrative texts, the 
latter to narrative fiction. Absence of theory may be less of a problem 
for expository texts, since the logic of the argument should largely 
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determine the structure of the text, and a well written text will make this 
structure explicit, both at the sentence level, and at higher levels of 
structure. Where a large number of Inferences are required to connect In- 
dividual sentences and paragraphs with each other, and where there Is an 
absence not only of explicit connectives, but of syntactic or lexical clues 
wlth_ which Intended conceptual relations between parts could be Inferred, 
we would predict that comprehension will be very poor comparatively. 

2. Interacting Plans Analysis 
a. Introduction 

If we want to know why spme children have difficulties learning to 
read, we must develop a better Idea of what they are reading, or could be 
reading. Analysis of texts alone will not tell us what Is the best way tp 
help someone learn to read, but it makes asking better questions possibly. 
This section, together with the reports, "What makes a. good story?" (Bruce, 
1978) and "Interacting Plans" (Bruce & Newman, 1978), constitute one part 
of the larger text analysis effort. 

Understanding the plans and beliefs of characters in a narrative is 
clearly an important aspect of reading comprehension. The study of such 
plans is part of text analysis because it depends on an exacting study of 
the w&y actions are described In the text. It, is also an analysis of'what 
goes on "between the lines" of a text since it considers motivations and 
reasonings that may not be explicitly stated. 

A plans analysis is particularly appropriate for two of the central 
questions of text analysis: "What is it that makes a given story easy pr 
hard to comprehend?" and "What is it that makes a given story good or bad? 
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The answers to these questions surely Interact, but It Is not true that 

difficulty alone determines the quality of a text, nor even Its, approprlate- 

} ■ ■ 

ness for a child whose reading skills have develpped to a known level. The 
concluding section deals with this Issue further. 

b. An Interacting Plans Analysis 

An Important aspect of a narf'atlve text Is that It relates actions 
connected through goals, effects, and enabling conditions. The statement, 

: "We understand actions in terms of goals," has become a truism; actions 
simply are the way goals are attained. This Is true for the realms of 
conversation, stories, or human activity In general, and there has been 
extensive work to show just how goals and actions relate. But an Important 
Implication of goal -based understanding of actions Is often overlooked. If 
we can Interpret an action In terms of goajs, then so can others who may be 

. affected by that action. They may then act, not just in terms oiF their 
goals, but in terms of their understandings of the actor's goals. This 
means that when two or'more people Inter-act, ,thei r plans can reach a level 
of complexity that Is difficult to foresee from consideration of single 
actor plans. 

The distinction between simple plans and Interacting plans can be seen 
In the first part of the Grimms' fairy tale, "Hansel and Gretel" (Appendix 
1). Hansel and Gretel are the young children of a woodcutter. Their step- 
mother convinces the woodcutter that they have too little food for both the 
parents and the children. Her solution to this problem is to take the chil- 
dren into the woods and abandon them. The execution of her plan starts the • 
first episode of the story, an episode that is analyzed in detail. 

ERIC 
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A simple analysis of "Hansel and Gretel" would show that Hansel has a 
goaH'to be able to return home after being taken into the woods. To reach 
that goal Hansel drops pebbles along the trail so that the children can re- 
trace their steps^ We could understand what Hansel does In terms of a plan in 
which dropping pebbles is an action appropriate to the goal. The plan would 
show how the actions of dropping pebbles and folloyi/ing the trail fit to- 
gather, and how they produce desired outcomes for HanseT and Gretel. 

But such a plan would be incomplete. Hansel ar|d Gretel are being taken 
Jnto the woods del iberately by their parents. Hansel knows that he should 
drop pebbles because he and Gretel have overheard their parents plotting 
against them. The children'f plan is a response to tlijeir conception of 
their parents' plan. Hansel and Gretel are not Just "returning home" but 
are "countering" the plan they perceive their parents to have. It would 
have been of little use for Hansel to drop pebbles on a f^mi Itar trai 1 , aiid 
if his parents were planning to kill the children outright, some other re- 
sponse would have been more effective. His action becomes meaningful only 
with respect to his perception of the structure of his parents' plan. 

What we find upon further analysis is that each of the characters in 
the fajry tale is acting In a reality determined by his or her perception 
of the others' plans. They continually evaluate what the pthers are doing 
and react accordingly. Such behavior, characterized by interacting plans, 
is fundamentally different from that found for one-person plans. The dis- 
cussion to fol low develops this idea further. It relies on Ideas developed 
In work on planning algorithms (e.g., Tate, 1975; Sacerdoti, \1973, 1975; 
Sussman, 1975), on p,lan recognition (e.g., Schmidt 6 Sridharan, 1976), and 
on the use of plans (e.g., Cohen 6 Perrault, 1976; Bruce, 1979), but the 
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focus Is on the Interactions among plans. The piK)blcm Is not Just to show 
how actions can be organized Into plans. We need to do tha^t, but we also 
need to consider concepts such as cooperation and conf 1 let , actions defined 
In^tej-ms of plans , Intentional I ty , degree of Interaction , levels of concep- 
tualization , and beliefs about plans . The next section prtesents a simple 
example of an Intcracti-ig plan, taken from the analysis done In Bruce and 
Newman (1978). o 

c. A Simple Interacting Plan: A "Request" 

When they reached the middle of the forest, their father 
said, "Now, children, pick up some wood. I want to make 
a fire to warm you;" 

Hansel and Gretel gathered the twigs together and soon 
made a huge pile. Then the pile was lighted, and when It 
blazed up the woman said, "Now lie down by the fire and 
rest yourselves while we go and cut wood. When we have 
finished we will cpme back to fetch you." 

The parents' plan Is an Interacting plan, since It Is a plan to achieve 
goals In Interaction with the children. The Idea of an Interacting plan cin 
be Illustrated with a simple example (see above) taken from their overall 
plan. Figures 8 and 9 represent the parents' plan to build a fire for the 
children once they are out In the fores'.' Figure 8 represents the parents' 
plan to keep the children warm while €hey are waiting for the parents to 
finish cutting wood. With this example we can Illustrate some of the basic 
nodes and relations used In the representation system. Then, using Figure 8 
In combination with Figure 9, we can Illustrate a simple example of Inter- 
acting plans. 

.. One action ("Parents light pile of twigs") Is shown In Figure 8 In the 
square node. Fire burning Is a simple state which satisfies the Intentional 
state (labelled "P. A") which Is the mental state leading directly to the act. 
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The IntendAchleve state Is specified by an intendMaintain state which in 
this case is the more general intention to Iceep the children warm. Since 
the parents Icnow that a burning fire will Produce warm children, they Icnow 
^that the general goal of Iceeping the children warm can be accomplished in 
this case by causing a fire to be burning. - 




CHILDREN 
WARM 



Figure 8 

The parents' plan to l^eep the children warm 
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The link to Figure 9 Is the state "Twigs In pile." This Is a neces- 
sary cond I tlon for the pile of twigs !to be 1 1 1 so 1 1 I s linked to the act 
of lighting by an Enables relation. Whenever an act has an enabling condition 
'that Is not met, an Intention to achieve that state Is generated. In Figure 
2 that Intention Is represented as an Intention by the parents to cchleve 
the state of a pile of twigs existing. In another situation this goal might 
be achieved by going about gathering twigs. But here, the parents choose to 
get the children to perform the necessary actions. Thus, we have an elementary 
Interacting plan. The Intention to achieve a pile of twigs Is changed Into 
an Intention to achieve an Intentional state In the children. This new goal 
Is achieved by means of saying to the children, "Now, children, pick up some 
wood. I want to make a fire to warm you." This, of course. Is a request, and 
\t has the effect of the.chl Idren having the Intention to achieve a pile of 
twigs by means of gathering twigs. This action satisfies the parents' Inten- 
tion to have a pile of twigs and satisfies the enabling condition for tlf»elr 
building a fire. Notice that while the parents' Intention to have twigs In a. 
pile Is present at time a, the children's Intention comes Into existence at 
time b, only after the parents' request. 
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Figure 9 

A simple Interacting plan: Requesting help from the children 
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d. Virtual Plans 

One of the most str Iki ng things that emerges upon even a cursory exam- 
ination of simple storiesor conversations is that people frequently act 
in thft context of complex, plans. %y/en more striking is that they often act in 
response to or with f'espect to plans that they perceive being carried out, or 
that they intend for someo^p else to carry out, that is, plans which have the 
structure of actual plans but are not in fact used'as determinants of behavior. 
For example, in one segment of natural conversation (collected by William Hall), 
an .analysis in terms of plans shows that a mother in a mother-child conversa- 
tion is acting in response to her perception of the child's plan and the plan 
she wishes the child to have. Either or both of these plans may differ from I 
the plan the child actually has. 

'J 

.The result lis a much more complex rtructure in which plans themselves be- 
come units. Consider, for example, the story of "The Dog and the Cock" (Ap- 
pendix II). In it, a fox tries to entice a rooster to come down from a tree 
by flattering him and inviting him to "come to breakfast." The rooster accepts 
the Invitation with the proviso that he may bring along his friend, who happens 
to be a big dog. The fox greedily agrees but soon finds that the dog is not 
another rooster (■ breakfast) but a foe who bites him on the nose. Clearly 
there is a plan of the fox to ea^ the rooster. At the same time, we have the 
rooster's plan to get rid of the fox. These plans Interact with each other in 
terms of a third plan which neither of the characters expects to carry out nor 
Intends to have cari^ied out. 

The virtual plan Is that the rooster comes to the fox^s house as a guest 
for breakfast. The rooster, we may assume (though many children in fact do 
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not assume this), does not intend to carry out this virtual plan. Nor does 
the fox expect the rooster to carry It out. Nevertheless, both characters 
talk about 1^ as If It were real. The fox, for Instance, says. "Will you 
come to my house for breakfast?" thus pointing to the virtual plan. The 
rooster accepts the virtual plan, or at least appears to do so, when he says, 
"Yes, thank you, I will come," and then he suggests a modification when he 
says, "If my friend may come too." The fox accepts the modification, say- 
ing, "Oh yes, I wl 1 1 ask your friend. Where Is he?" Thus we /have a situa- 



tion In which the characte .. discuss 'and modify a plan which neither expects 
to be carried out. 

Virtual plans are common In stories. Hansel and Gretel's parents use 
the virtual plan of ordinary wood fetching to pursue their real plan of get- 

♦ 

ting rid of the kids. Meanwhl le , Hansel and Gretel use a s Iml 1 ar vl rtual 
plan to achieve their real goal of returning home. In fact, the following 
outline appears to be a good modjsl for a large class of children's stories. 
It applies, for Instance, to "Hsjnsel and Gretel" and "The Dog and the Cock." 
"It defines a kind of deception wherein characters act on the basis of real 
plans, but pretend to act on the; basis of virtual plans. 
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Outlines of a Typical Story Qontalning 
\ a Virtual Plan 

1. A has a problem which suggests a goal that is in conflict with 
a goal of B. 

2. A realizes that B*s normal actions (or inactions), i.e., B's real 
plan, will not help him to achieve his goal. 

3- A further realizes that B will not alter B's plan to suit A's goals. 
k. A therefore puts forth a virtual plan either to conceal A's real plan 
or to entice B into doing something B would not otherwise have done. 

5. B respon^ to the virtual plan. In some cases B falls for the trap, 
e.g., in Aesop's fnble of "The Fox and the Crow** the crow sings in 
response to flattery and drops a piece of meat. In other cases, B 
sees through the virtual plan to A's real plan, then pretends to go 
along with the virtual plan, or puts forth B's own virtual plan. 

6. Actions proceed, but each action has alternate simultaneous interpre- 
tations, as part of the virtual plans and as part of the real plans. 

7* At some point the virtual plan is discovered, or uncovered, and the 
story (or episode) draws to a close. 



e. Critical Bel lefs 

One of the most Interesting results of our analyses In -terms of Inter- 
acting plans has been the realization of the tightly Interwoven character 
of the plans representations. Any representation of "meaning" can be 
viewed as an arbitrary and unsatisfying abstractions from the "whole," but 
plans seem even less divisible than other facets of mea.ing. It is much 
more difficult, for example, to change one smeW part of a plan's represen- 
tation without producing rippling effects throughout the representation. 

A consequence of this holistic property of plans Is that a single be- 

r 

lief can assume tremendous Importance. In 'The Dog and the Cock", the 
reader's belief that the Cock believes that foxes like to eat cocks appears 
to be a critical bel lef for the building of the typical adult Interpreta- 
tion* of the story. Some children do not seem co have this belief and build 
a different Interpretation In which the Cock Is an unwitting potential vic- 
tim of the Fox, who Is saved through no effort of his own by the Dog. 

The latter Interpretation Is internally consistent, and It matches 
the story as well as the typical adult interpretation. Is It therefore 
also correct? How many different Interpretations are there? We may not 
be able to answer those questions, but we can observe that the one criti- 
cal belief has had significant rami ficatlons 'for the Interpretation. Con- 
sider how readers with the two Interpretations would answer the following 
questions: 

1. Did the Cock trick the Fox? , 

2. Did the Fox trick the Cock? 

3. Was the Cock smart? 



k. Did the Cock think that the Fox liked his singing? 
5. Was the Cock happy with what the Dog did? , 

The notion of critical beliefs seems worth pursuing. It may account 
for some differences In Interpretations due to cultural variation found 
among readers. It also needs to be considered when we think of testing for 
comprehension. Finally, It. shows one more way In which the things the 
reader brings to the text are as crucial to understanding as what Is "In" 
in the text. 

f . Complexities — Easy vs. Hard Texts 

. A formal analysis demonstrates that even .apparency simple stories may 
require complex plans representations. A reader needs to be able 4o Induce 
plans from the often sketchy statements of actions and Intentions. He or 
she must then be able to use tne Induced plans to connect events. There 
are a number of specific abilities a reader would need in order to under- 
stand plans In this way. We do not know that" these abilities are a major 
cause of reading comprehension difficulties, or even that they form a com- 
plete or well-defined set of skills with respect to understanding plans. 
Rath^, they poKnt to areas that might be worth investigating. 

Among the complexities are the following (see Bruce & Newman, 1978, 
for a more detailed discussion): 

1) Changes In Plans . Plans In a story can remain fairly constant, 
like Babar's plan to retrieve his crown, or may change In response to events. 
The number and magnitude of changes may be a source of difficulty. 
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2) Size of plan . Plans vary In their Inherent complexity. A plan 
may Involve a long sequence of acts or may be accomplished by a single act. 

3) Embeddlngs of beliefs . Whenever a belief Is about another person's 
beliefs, one must be able to shift point of view. Sometimes a story, e.g., 
"Hansel and Gretel," requires multiple shifts of point of view, to beliefs 
about bel lefs , etc. 

1») Embeddlngs of Intention . Similarly, Intentions may be embedded. 
For example. Hansel and Gretel's parents Intend the children to have the 
Intention of following them Into the woods. 

5) Embeddlngs of plans . A consequence of the embeddlngs of beliefs 
and Intentions Is that one's plan can be defined with reference to other 
plans, and those plans to yet other plans. Hansel's stepmother, for 
Instance, tries to block HanseJIs atterppt to block her plan to abandon 
Hansel and Gretel . 

6) Degrees of Interaction . Wh^n there are multiple actors In a story, 
their plans can be more or less Interconnected. Hansel and Gretel's plans 
are Intertwined with their parents' plans. Each Is trying to respond to 
the others and to get the others to do an act In a particular way. In 
other stories there may be only one character, or characters' plans may not 
Interact as tightly. 

7) Deception . A story that Involves deception, «.g., "Hansel and 
Gretel," Is Inherently more complex than one that does not. 

8) Conflicts . The number and types of conflicts among plans In a 
story may also be a source of complexity, in a general sense, we can view 
actions as attempts to reduce ccnfUcts among plans. For example, there Is 
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a potential conflict between a plan I believe you to have and the plan I 
want you to have. There Is also a potential conflict between a plan I 
want you to believe I have, and the plan i believe you believe I have. It 
Is not necessarily the case, however; that plans of any type can conflict 
with plans of any other type. In fact, the Identification of types of 
plans leads us to an Identification of types of conflicts that can arise 
among plans In Interactive situations. This suggests a number of questions 
about people's recognition of and response to such conflict sltuatfons. 
For example: 

a. To what extent do people of different ages recognize embedded 

i 

plans? " 

b. How deep do these embeddlngs go7 

c. Do the potential conflicts actually arise In all situations? 

d. How sensitive are people to the ability of those they are Inter- 
acting with to perceive such embeddlngs and conflicts? 

e. How are the virtual plans and the conflicts among them signaled 
In text? 

f. What are the consequences of one's understanding of different 
levels of Interpretation of virtual plans? 

9) Maintaining different points of view . Having to maintain different 
points of view, e.g., that one character believes X where another believes 
not-X, Imposes demands on the reader. In addition to the levels of 
embedding mentioned above, there may be problems associated with maintaining 
a large number of differing beliefs or maintaining any differences for an 
extended period. 
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to) Cultural presuppositions . Stories that Involve beliefs about 
character types or simply facts about the physical world may place differ- 
ential, demands on readers depending on their experiences prior to reading. 

tl) Beliefs outside of "shared belief space." In a normal episode 
most beliefs are "shared" among the participants, meaning not poly that 
"they both believe, but that they believe that the other believes, and that 
the other believes that they believe. The reader can then assume that all 
knowledge Is transparent to all. Often, though, one must assume that there 
are beliefs outside of the shared space, not necessarily conflicting 
beliefs, but beliefs that are not known to one or more characters. 

12) Inference . The number of extra beliefs needed and the amount of ^ 
deduction required to link together actions In a story Is. also an Indica- 

a ft 

tlon of possible difficulties for the reader. 

t3) Expllcitness of plans . Texts vary In the degree to which they are 
explicit about the plans anH Intentions of characters In the story. Stories 
are more difficult when the reader has to Infer plan structures from the 
simple statements of actions. 

Il») Act hierarchies . An important aspect of Interacting plans Is that 
people develop them and carry them out in the context of their perceptions 
of others' actions. The same action can be viewed at various levels or 
clumped together with other actions. There can be many levels of concep- 
tualization for the same act or sequences of acts. 

g. Conclusion-- Impl Ications for Education 

It would be presumptuous to suggest revisions in education on the basis 
of an interactive plans analysis of a handful of children's texts. After 
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all, the variation among texts could be greater than that suggested here; 
the complexities that have been Identified have only tentatively been shown 
to cause specific problems In comprehension; and the method of analysis Is 
Itself new and untested. Nevertheless, there are clear, albeit tentative, 
signs that have emerged from the analyses we have done. 

The comments below are separated into two overlapping categories: 
Issues related td texts and Issues related to learning to read. The first 
set applies to decisions about selection of texts for beginning readers, 
for tests of comprehension, and for workbooks and similar "school texts, 
The second set discusses teaching methods, testing, and the development of 
reading ski 1 Is. 

1) Text selection . First, with regard to text charactcrl sties , the 
interactive plans analyses that we have done Illuminate a world of phenomena 
that are Implicitly Ignored In the design and selection of texts for u5^ In 
school workbooks, tests, primers, and textbooks. Stories such as "Indian 
Occupation" are commonly used, although they are very poor as literary 
texts, because they offer an opportunity to teach such things as the "oa" 
sound. But what effects might frequent exposure to such texts have? 

For the fortunate child who has books at home and parents who support 
and encourage literary explorations, there may be no harm done and the drill 
on a specific skill may be useful. But such a child Is not the one with the 
greatest reading problems. 

Think for a moment of the child who has limited reading experiences out- 
side of school, who has few picture books, and who does not hear stories being 
read. In the early grades, he or she encounters a series of texts that stress 
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decoding skills. The texts often sacrifice the story line under the assump- 
tion that component skills need to be taught Independently. Thus, It is 

assumed, story structure can be taught vyhen its time comes; there Is no need 
* 

to demand high quality stories when one is teach ing* decoding. Later in 
school there are expository texts to read. It is assumed that the child is 
already a reader, even though critical reading has been rarely taught, or 
even encouraged. The skills the child is supposed to have learned are Just 
etc be "applied" in learning new subject matter. Throughout tHe elementary 

^^rades, the literary diet of the child falls to exercise the skills of 

t . . 

critical reading. Often the result Is that motivation suffers as well as 
ski 1 1 development. 

If an Interactive plans analysis shows nothing elscylt still demon- 
strates that understanding even a "simple" fairy tale requires sophisticated 
skills. Where are these skills to be learned. If not through reading (or 
being read) good texts? 

It Is only partly facetious to propose a text quality hierarchy of the 
following kinds: 

a. Texts never seen In school. 

b. Texts allowed when 1:he regular work is done. 

c. Texts read for a purpose other than learning to read. 

d. Texts used to teach reading. 

e; Texts used to test reading ability. 

f. Texts used to teach specific component skills (often used In 
remedial clasifcs). 

A reader who gets enough of types a and b will learn to enjoy reading, wl 1 i 
learn that there I^s a point to reading, and will learn the skills necessary to 
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read with understanding. These three types of learning exist In positive 
feedback loops, as shown In Figure 10. 




Figure 10 
Learning to read 

But the child who only sees texts at levels c and d or worse never enters the 
loops. He or she Is then tested at level e and punished with level f texts . 
for failure to perform on the test.' 

The text quality Issue Interacts with another, complexity. While It Is 
Important to realize that reading Is a complex ski 1 1 , we must not assume that 
"simpler is bi^^tter" with regards to text selection. Complexity Is multidimen- 
sional; the best text may be one that challenges the reader on a few dimensions 
and allows easy success on others. The attempt to produce a single number mea- 
sure of complexity or difficulty Is probably misguided. 

Furthermore; when one moves beyond the orthographic and lexical levels of 
analysis one finds more and more a tendency for texts to be understandable In 
different ways. We suspect that few three-year-olds, upon hearing "Hansel and 
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C^retel*" would understand It as an adult would. Yet their understanding, 
though possibly limited, is not wrong in the sense that saying "cat" to the 
letters "D - 0 - G" would be. It is a characteristic of interacting plans 
and,lwb think, in general of the phenomena of literature that one can uncover 
multjiple meanings, no one of which is wrong, or even unsatisfying. The 
bestj approach may be to give children the best literature (expository or 
narf^«t+ve) that we can find, letting the child, rather than some arbitrary 
formula, decide whether It Is too complex* 

2) Teaching reading . What do Interactive plans analyses tell us about 
Jearning«tQ read? One point was broached in the previous section. Under- 

taj^dlng plans In stories Is a complex taste that may require years of 
exposure to high quality texts to learn. Consequently, we should expect 
chrldren.and adults to understand stories In different ways, simply because 

they have had varying amounts of experience. It would not be surprising to 

\ • 

f ind^^examples of understanding at each of the following levels to be an 
indicator of experience with reading: . 

4.. Isolated sentence understanding r- each sentence is understood but 
connections are not made. _ 

\ » 

b. Islands of understanding — local connections among sentences are 
made but no overall pattern is seen. 

\ ■ . . 

. c. Limited plans understanding — basic plans are comprehended, but 
not Interacting plans, 
d. Embedded plans understanding*^- ful 1 understanding of the Inter- 
actions among plans of characters In a story* 
An Interacting plans analysis a1$o gives some guides for our expecta- 
tions about developing readers* Flrs^^ the complexity of plans means that 
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readers may understand In different ways, yet still be reading, and hence, 
learning to be better readers^ Second, the Importance of "critical beliefs" 
means that readers with different backgrounds may build divergent inter- 
pretations ofuhe same texts. Both of these points need to be considered 
seriously when we think of what it means to test comprehension sMlls. / 

3. Story Grammar Analysis / 

This section has two goals. The first is to illustrate how a specific 
set of predictipni about story memory can be made, using a story grammar 
analysis, the second is to raise some critical issues about story compre- 
hension which l^ave not been directly addressed. 

In the past few years, it has become increasingly apparent that models 
of single word or sentence comprehension cannot account for many of the 
important factors affecting the comprehension of discourse material. 
Although theories of discourse comprehension must eventually explain how 
these smaller units Influence the comprehension of an entire passage, an 
approach describing the semantic relationships between sentences Is 
necessary. 

In an attempt to extend Bartlett's (1932) work oh story memory, Propp's 
(1958) morphology of the folktale, and Rumelhart's (1975) initial scheme for 
stories, several story grammars ^have been constructed to describe the struc- 
tural basis of story understanding. A major theoretical assumption of these 
grammars specifies that memory for stories Is a constructive process, re- 
sulting from the Interaction between incoming Information and pre-existing 
cognitive structures, containing knowledge about the generic characteristics 

Si . 
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of stories. These structures or schemata, defined as a set of rewrite rules, 
influence the way In which a listener will break down Incoming story Infor- 
mation Into Its component parts. Thus, schemata aid the listener by speci- 
fying the types of Information and the types of logical relations which should 
occu"- at various points In a story. It Is then assumed that the listener can 
determine whether any necessary Information has been omitted or whether the 
correct logical relations have been used to connect the various story component 
Thus, the major assumption underlying the description of a story schema is 
that^ comprehension of a story Involves the use of an Ideal story structure 
to reorganize, represent, and retrieve Incoming information. When text struc- 
tures do not con-form to the rules specified by, a story schema, then readers 
or listeners will attempt to transform the Incoming information so that a 
representation adhering more to the structure of an Ideal story schema can 
be constructed. 

In several studies on story comprehension. Stein and Glenn have described 
a story schema in detail and have presented evidence to support several hypo- 
theses concerning the validity of a story grammar (Stein, 1978; Stein & Glenn,, 
1977; Stein 6 Nezworslci, 1978). These results will be summarized by pre- 
senting a description of the types of story recall that should occur if a 
narrative such as Babar Loses His Crown were to be presented to adults and 
children. In order to understand how a schema interacts with incoming story 
information, a brief dsscription of the process of analyzing a simple story 
Into Its component parts is presented. Although BABAR Is a complex multiple 
episode story, the example of a simple story brcaltdown Is representative 
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of the process Involved In organizing Information from complex stories such 
as BABAR. 

According to the Stein and Glenn (1977/ grammar, the primary unit 
of analysis In a Skory Is the category, and several categories occur within 
a story structure. Each category refers to a speclfl". type of Information, 
and serves a different function In the story. Normally, eacl- ,tencc In a 
story can be classified Into a particular category. However, the sentence 
is not the critical variable defining category membership. There are in- 
stances In which the initial part of the sentence ^belongs to one category, 
and the latter part to a second category, depending upon the functional role 
each portion plays In the story. It should also be emphasized that a story 
category can contain one statement or several such statements. 

A story structure can be described In terms of a tree diagram which 
is a hierarchical network of story categories and the logical relations 
which connect them. The initial division of a story consists of two parts: 
a setting plus an episode structure. Jhe setting begins the story with 
the Introduction of a protagonist and normally includes Information about 
the social, physical, or temporal context pertaining to the development of 
t.ie episode. The setting is not part of the episode, as It is not directly 
related to the subsequent behavioral sequence described in the episode. 
However, information In the setting category may constrain the possible 
types of behavioral sequences which then occur. 

The remaining story infopation In the episode consists of a sequence 
of fivfe categories: Initiating event, internal response, .attempt, conse- 
quence, and reaction. The Initiating event category contains some type of 
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•v«nt or action which marks a change In the story environment. The major 
function this change Is to eyoke some type of response from the pro- 
tagonist which Is defined as thej Internal, response category. Internal re- 
sponses can Include goals, effective states', and cognitions, and serve to 
motivate a characterVs subsequent overt behavior^ Actions which describe 
this overt behavior are defined as attempts. A character's attempts then 
result In the direct consequence of a character's actions, marking the 
attainment or non-attalnfnent of the character's goal. The fine', category 
Is the reaction which can Include a character's Internal response to the 
consequence or broader consequences caused by the goal attainment. If the 
relationship between the setting and episode Is Ignored, It Is apparent 

that each category logically follows the preceding one. Furthermore, 

♦ 

according to the grammar, these categories always occur In a specific 

i 

temporal order. 

There are several factors which alert a reader or listener to the 
fact that one category has ended and another one begun. Temporal markers 
jsuch as "One, day," "Suddenly," "Finally," etc. often signal the beginning 
of a new category, facilitating the breakdown of stories Into components. 
The semantic content of a statement and the relationships among statements, 
however, are Just as Important In determining the division among categories 

From the previous description. It Is evident that certain types of 
Information are always contained In the Internal representation of an epi- 
sode and that the temporal order of category Information and the logical 
connections between categories are also critical components of a story 
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schema. By using the rules described in the grammar, a set of predictions 
has been made about the nature of story memory, with specific reference to 
story recall. Although recall does not guarantee an Isomorphic correspon- 
dence to the underlying representation of story information. It does allow 
an initial Assessment of the types of story Information thought to be most 
critical in maintaining the semantic cohesiveness of the text of a story. 
Therefore, the next two sections will discuss predictions concerning the 



types of Information most frequently recalled. 



the types of new Information 



which will be added to a story, and the order In which story Information 
wi 1 1 be recal led. 

a. Probability of Statements Occurring in Recall 

*' In past studies, it has been argued that several factors influence 
the probability of recalling individual story luJtements. A story statement, 
in most instances, Is equivalent to a single sentence in the text of a story. 
The sentence, however, Is not necessarily the critical component for defining 
whether Information can be classified Into one or two statements. Rather 
Is is the function of the Information within the context of a story that is 
the critical determinant of the unit ..f analysis. Stories are basically 
concerned with goal-oriented behavior and Ideally consist of a sequence of 
statements directly related to the attainment of the goal. Therefore, the 
types of logical relationships existing among story statements in an epi- 
sode are the critical factors In predicting the saliency of individual 
statements in recall. In the story studies, accurate recall is defined as 
the production of statements containing an extremely close correspondence 
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to the semantic content of the original story material. The syntactic form 
of story memory Is less Important. If the relationships among statements 
are directly causal In nature and are related to the character's major goal 
attainment, then they have a high probability of being recalled. In recall, 
then, certain story statements assume a more Important role than other 
statements. 

Two additional factors are Important In predicting the sal lency of 
each story statement. The first factor concerns the semantic content of 
the statement. Although two statements may be causally related to one 

another, the Information In t^e first statement may directly Imply the type 

1 

of Information In the second statement. In this situation, the recall Qf 
the- second statement becomes unnecessary or redundant. A Second factor 
concerns how well a particular story statement matches the type of knowl- 
edge acquired about the specific sequence of events being presented. Often 
two statements In a story will again be causally related to one another, 
but the listener will recall a statement that Is an Integration of both 
statements, or a statement which contains Information from which the actual 
story statements could be Inferred. Thus, the semantic content of a state- 
ment, as well as the type of relations among statements, affect the proba- 
bility of recalling Individual statements. 

In past studies It has been found that certain categories of Informa- 
tion were better recalled than other categories. The categories most fre- 
quently recalled were major setting statements (characterized by the Intro- 
duction of the protagonist). Initiating events, and direct consequence state 
ments. Attempts were In the middle of the frequency distribution, while 
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Internal responses, reactions, and descriptive settings were Infrequently 
recalled. These results seem to suggest that statements within these 
latter categories are either semantlcally redundant or not^ directly re- 
lated to the protagonist's attempt to attain a specific goal. 

The sallency of story Information Is also related to the organization 
of story Information Into higher order units. The episode Is the main 
psycho logUal unit In a story structure. Just as there are different 
types of relationships among statements within an episode, there are also 
different types of relations linking the episodes of a story. The rela- 
tionships among episodes also play a critical role In determining whether 
story statements will be recalled. In many stories, such as BABAR, there 
Is one overriding goal stated In the first episode of the story. The re- 
mainder of the story", then, consists of a number of episodes containing 
subgoajs that are directly related to the protagonist's desire to attain 
the major goal. If an episode contains subgoals directly related to the 
major goal. It should be well recalled' However, there may be episodes 
contained In a story which have only an Indirect relationship or no rela- 
tionship to the major goal. Because these episodes are "empty" In the 
sense of being unrelated to the goal, they serve little purpose and are 
readi ly forgotten. , 

b. Temporal Organization of Stories 

The next set of predictions derivable from story grammar analysis con 
cerns the variables which regulate the ability of a listener or reader to 
recall the correct temporal sequence of a givert story text. Two categorle 
of results wl 1 1 be given. 
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1) Wl thin-eplsode sequences . If the temporal organization of a story 
text corresponds to the structure described by the grammar, subjects will 
have little difficulty organizing Incoming Information and recalling the 
temporal sequence of a story text* The data from several studies strongly 
Support this result. Even children as young as four can recall the current 
temporal sequence of a story text. However, when the text sequence diverges 
from or violates the order specified by an ideal form, listeners have dif* 
ficulty malntalnlnig the exact order of the text and reorganize the story 
text In several different ways. The reorganization of the text Is repre- 
sentative of a tendency to recall Information In an idealized form corre- 
sponding more to the structure described In the grammar than to the sequence 
given In the text material. 

The type of reorganization that subjects will use during recall depends 
upon: 1) what type of story information occurs out of sequence In the text 
structure; 2) the text distance of the information from the hypothetical 
location prescribed for It In an Ideal structure; and 3) the relationship of 
the moved Information to the new surrounding story Information. If the story 
sequence In a text is altefed by simply reversing the positions of two ad- 
jacent statements, subjects recall the story In one of two ways: they either 
reverse the two statements so that the order resembles that described In an 
ideal structure or they Insert a causal or te poral connector to signal the 
fact that an Inversion ^has occurred. When the Information in an experimental 
text Is moved more than one location away from its ''normal' position in an 
ideal sequence, different strategies are used. In this case, subjects either 
interpolate extra material to connect the moved information more 
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appropriately to surrounding Information or they delete the moved Informa- 
tion a1 together. 

2) Episode organization . In many stories, there are two or three 
episodes which are sequenced In an arbitrary fashion such that there Is no 
a priori reason that one episode should occur before another episode. There 
Is no direct causal relationship linking the £wo episodes. In this case, 
the probability of a subject's maintaining the correct temporal order of the 
episode's In recall decreases markedly, Thus, the type of connection linking 
episodes nQt only predicts whether an episode will be recalled, but also 
predicts the order In which episodes are recalled. 

c. Current Problems In Studying Story Comprehension 

Although preliminary results are very promising for the use of a story 
grammar to understand parts of the comprehension process, th« Important 
work In this field Is In an Initial stage. There are several Issues which 
still need much more Investigation. 

l) Encoding . To date, most of the studies completed on stories have 
used only recall procedures. While recall Is Important In assessing memory 
In terms of retrieval processes, this type of methodology does not directly 
answer questions about the encoding process or the actual process of repre- 
sentation. Again, one of the major assumptions underlying memory for 
stories Is the proposed Interaction between incoming Information and pre- 
existing operations and knowledge structures; It Is not clear at the present 
time how the Interaction of these variables differs during encoding and re- 
trieval. For example, Stein and Ne2worskl (1978) have shown that sub- 
jects recall stories In a very specific type of temporal sequence and will 
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transform Incoming Information so that the order of output matches the 
order described In an Ideal story structure. However, these transforma- 
tlons may occur only as a function of the constraints placed on working 
memory during retrieval. The underlying representation of the story may 
be more complex and more representative of the pattern of Incoming Infor- 
mation. Both Handler and Johnson (1977) and Stein and NezworskI (1978) 
have stated that although there may be similarities In the process of using 
schemata during encoding and retrieval, there are also significant 
differences. 

One method of showing the differences between encoding and retrieval 

processes Is the use of recognition and probe procedures. Stein and Glenn 

K. 

(1977) have already demonstrated that certain types of story information 
(Internal responses), infrequently recalled by children, are very accu- 
rately encoded (when probe techniques are used to assess comprehension). 
Stein and NezworskI (1978) have also shown that although subjects re- 
order stories containing violations of an ideal form, they are aware of the 
types of violation occurring In the structti're of a text. Thus, during the 
process of encoding stories subjects incorporate much more and different 
Information about the text. structure than they are able to retrieve. 

2) Inferences . The difference between the encoding and retrieval of 
stories also raises questions about the actual structure of the representa- 
tion of stories. One of the more important issues related to the compre- 
hension of stories concerns the types of Inferences made during the process 
of encoding and representation. Story information often deals with moral 
dilemmas where children continually make inferences about the acceptability 

9'> 
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of a character's goals, plans, and attempts to attain the goal. That Is, 
Inferences are spontaneously made about whether or not a set of actions are 
good or should have been performed and whether or not the character Is seen 
as good or bad. Inferences are also made about a character's personality 
traits, affective states, and perceptions about some of the story events. 

Although stories sometimes explicitly state this type of information, 
more often than not the reader must make inferences about this Information. 
Because these Inferences are related to the way In which children apply 
story Information to their own problems and behavior, It becomes critical 
to begin a more detailed Investigation of this type of Information. 

■ 3) Semantics . The more general problem with existing story grammars 
Is that they do not provide a method for understanding the resulting repre- 
sentation of the specific semantic concept of a story. Although general 
predictions In the Stein and Glenn grammar were made about Information 
salience In recall, based upon the relational structure created among story 
statements, these predictions do not concern the specific content of the 
story material. The development of a more encompassing model Is necessary, 
laying out specific predictions about the recal labl 1 1 ty of each specific 
story statement. Although we can show that causal relations among state- 
ments and their relation to the character's goal are Important, these 
factors do not allow enough specificity about recall. 

A final Issue which needs Investigation concerns the changes which 
occur In story structures as a function of age. In one study on story 
production (Steln 6 Glenn, 1977), It has been shown that story length and 



0 I 



83 



the complexity of the story structure change as a function of age* Young 
children do not produce certain types of stories (e.g., Interactive char- 
acter episodes 9 complex goal structures , etc.). It Is now necessary to 
determine how these different structural character I sties affect comprehension. 

k. Analysis of Expository Structure 

The types of higher order cognitive structures used during expository 
comprehension have not been described in as much detail as those for stories. 
Although Meyer (1975) has specified some of the types of information that 
occur in expositions,, she has not proposed a general set of rewrite rules 
which specify the types of information that must occur in certain tisxts or 
the relationships which must occur among the various types of information. 
Because there has not been much effort geared to describing the internal 
representation of these texts, it is difficult to mal<e an explicit set of 
general predictions concerning memory for and comprehension of exposition. 

a. Expository Description ^ 

Although the structure of an exposition has many unique components when 
compared to story structures, there are many similarities between the two 
types of structures. In order to Illustrate these similarities, we will 
take exposltlpn and compare It to the structure of a story. First, In 
a "'Definition Exposition," there Is a major purpose inherent In the text. 
That is, the purpose of texts which define, a notion Is to transmit specific 
information about It to a reader. There :,'-e several additional assumptions 
which need to be made If we accept the initial one. 

When information is transmitted to a listener, we must also assume 
that there is a purpose for the transmission of such information. Normally, 
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a reader has a specific set of reasons for wanting to learn about something. 
He Is not only concerned about the unique descriptive characteristics of 
an object, but he Is also concerned about the f unctlons^'and uses of an ob- 
ject. These factors cannot be Ignored In describing the characteristics of 
an exposition. There Is purpose, directionality, and a cohesive structure 
underlying a definition exposition. 

In the first part of a definition exposition, labelled the generic de- 
scription category, two types of Information could be Included: 

1) the defining features of the object , 

2) a similarity continuum, defining the object by comparing 
its features to objects presumed to be already famlllVr^ 
to the reader. *' 

The defining features of the object are a critical part of the text and can- 
not be omitted. The representation Is defined as a list of features which 
characterizes the object and differentiates It trom all other objects. 

When the purpose of exposition is to provide new information, the second 
category of Information, the similarity continuum, Is often Introduced. One 
of the major purposes of this category is to enable the raader to use pre- 
existing Itnowledge structures to encode and construct a representation of the 
new Information. .By providing Information which shows both the similarities 
and differences to l<nowledge already acqulied about other objects, a new re- 
presentation can be constructed which incorporates some information from 
other structures but not an_ of the information. Thus, not surprisingly., 
this portion of our hypothesis bears similarities to the processes and re- 
presentation proposed for the comprehension of a metaphor (Miller, 1979). 

'I 
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The types of structures proposed for encoding expository Information are 
also similar to those proposed In Rumelhart and Ortony (1976). 

The Inclusion of the second cajtegory of Information majy be dependent 

upon' how much Information the reader already has acquired about similar 

i 

types of objects. If the subject's pre-existing knowledge Is extensive, 
then Inclusion of this category In the text may be necessary so that the 
definition features attributed to the object can be Instantiated correctly. 
If slmllarlty-dlfferentlailgn Information Is not Included, subjects may have 
a^ tendency to overgeneral Ize from the knowledge they already have, and In- 
c<3rrect features may be attributed to the ybject. If pre-existing knowledge 
is not extensive, then the defining features may be remembered without this 
type of information. 

The relationship or loglcail link between the first two categories can 
be character l.zed as an enabling relationship. The description of the de- ^ 
fining features ENABLES the reader to Initiate a search for knowledge or 
information similar to or different from the specific entity being described. 

The third category, functional consequence, . Is initiated as a RESULT, 
of encoding specific features about an object. Irt the DESERT text, the types 
of consequences which follow^rom the object description , are negative In na- 
ture, signifying the harm that can occur to all forms of 1 IJFe without the 
existence of water. However, in many ^exts, the type of object being de- 
fined can also be conceptualized as fir^t haying positive consequences .as so- 
dated with its function. The type of consequences first introduced in an 
'exposition should be a function of the knowledge already acquired about 
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similar types of objects and their defining attributes. These predictions as- 
sume that readers automatically malce inferences about either positive or nega- 
tive functions of an object. That is, there is a valence implied in the de- 
scription of generic attributes, and the quality of this valence will deter- 
mine whether positive' or negative consequences are stated first. 

r I ' 

If negative consequence category is given, the category following this 
should be a PREVENTATIVE ACTION category. The inclusion of this category as- 
sumes that the listener has specific sets of expectations such that he de- 
sires to Icnow about a specific type of end state when acquiring new informa- 
tion about an object. This assumption is similar to that proposed by Helder 
and discussed by Abelson, Aronson, McGuire, Newcomb, Rosenberg, and Tannenbaum 
(1968). Thus, the inclusion of negative consequences should cause the reader 
to infer preventative actions to neutralize r .negative consequence state. 

The fifth category, positive or neutral consequences of an object func- 
tion,, is the, last category in this structure. The relationship of this cate- 
gory to the previous one is an AND relation. It is assumed that the state- 
ment of negiitive consequence and preventative actions does not CAUSE or EN- ^ 
ABLE the statement of positive consequences. Thus the relation between the/ 
two consequence categories must be defined as an AND relation. It should 
be noted, h<iwever, that the structure of knowledge about objects probably 
contains information about both types of consequences; and there may be^an 
underlying jet of relations, not specified here, which allows the reader to 
make spontareous connections between the two opposite types of states. 

B. PI scoLir se P tjopertles 



ERIC 



This section describes a number of text-level discourse properties 

which have ((haracterl sti cs which may affect reading comprehension. 

i ' Or- I 
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Part 1 consists of a general discussion of Information structures which 
arc posited as scaffolding for the erection of a model of the world of the 
discourse In a representation of the comprehension process. Topic-comment 
organization and story grammar organization are both treated, and discus- 
slon Is Included of their relation to each other and of their relevance to 
different kinds of texts. 

Part 2 is a brief survey of jsom^ of the rhetorical devices used In, 
texts to emphasize and de-emphasike, or foreground and backoround, various , 

kinds of information. A variety of syntactic devices are discussed ! 

If ■ ' i 

which ref|l<^ct the narrator's or a character's point of view, or which re- I 

fleet thefiarrator's attempt to Influence the reader's perception of events 

N 

described.^ 

Part 3 contains discussion of three major text-connect jve devices: 
scene-setting devices, conjunctions, and anaphoric devices^ 



1. Information Structures ' 

There has been a great deal of Interest recently In the notion of 
Information conveyed by a sentence, how Information Is conveyed, what Its 
form Is, and whether the structure of a sentence or discourse Is dependent 
on or reflects the information communicated. It has been postulated fhat 
^comprehension of text depends crucially on the ability of the reader to match 
up the text with an appropriate organization within which to interpret it. 
In this section we examlf'te a number of the terms used In the description of 
Information structure schemata, and evaluate the feasibility and utility of 
applying them to a complete text such as BABAR or DESERT. 
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Let us begin with the notion topic . Some authors use this term to 
mean discourse topic , or what a discourse Is about. Other authors use It 
to mean sentence topic , or what a sentence Is about. Topics are sometimes 
considered to be unexpressed propositions which characterize "aboutness" of 
a discourse (see Keenan 6 Schleffelln, 1976). That Is, throughout a dis- 
course, topics are established, and any given sentence will be related to 
the current discourse topic or establish a new topic. There do not seem to 
be any good tests for this kind of discourse topic, and, as we will see 
below In a partial analysis of topic In BABAR and DESERT, the notion "dis- 
course topic" still needs quite a bit of refining. According to other 
scholars, topics are syntactic elements. These topics are either surface 
constituents In some languages (see LI & Thompson, 1976), so that the basic 
surface structure of sentences Is [topic-comment], as In Lahu, LIsu, and 
Chinese, or they appear as nodes In a semantic representation that may or 
may not be syntactically realized (see Gundel , 1977). 

Another term used by some text-structure theoreticians Is theme. This 
Is used variously to mean 1) the point of departure of a sentence. I.e., 
the element that comes first In the sentence (see Hal 11 day, 1967); or 
2) those elements In the sentence that are a 1 ready wl thin the hearer's or 
reader's sphere of knowledge. I.e., Information that has already been 
Introduced to the reader or hearer by the writer or spea^'er (see Danef, 197 
Another set of terms Is given and new Information . Given Info rmation 
Is used to mean the Information the speaker believes to be present In the 
(short-term) consciousness of the hearer (see Chafe, 1976), or what the 
speaker believes to be recoverable from previous discourse (see Halllday, 
1967). 
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Chafe (1976) proposes as a distinct term, known Information , which he 
takes to be those elements that are subject to def Inl tizatlon In a sentence. 
Kantor (1977) suggests a concept of degrees of acti vatedness of Information 
In the consciousness of the hearer or reader to explain the degrees of com- 
prehenslbl 1 1 ty of various referential expressions, e.g., pronominal, demon- 
straclve, definite, definite with added descriptors. ^ 

A major problem with almost all the studies referred to above Is that 
they give no methodology for the. the analysis of texts In general. Indeed, 
the analyses used to exemplify the theories are usually restricted to crit- 
ical texts composed of between two and six sentences. As a result, we find 
the various constructs to be difficult to apply to actual texts like BABAR 
and DESERT. 

To illustrate, let us take the notion of discourse topic. The notion 
itself seems quite transparent. I.e., that the discourse Is at any point 
about something, someone, or some event. But notice what happens when we 
try to apply this notion to our texts: 

Title: Babar Loses His Crown 

<7 

1. The Babar family Is going to Paris. 

(what is the discourse topic? Perhaps that the family is going 
to Paris.) 

2. Everyone is packing bags. 

(This sentence describes what is going on in the illustration. 
Perhaps the topic is what the family does to prepare to go to 
Paris. But this does not seem very satisfying.) 

3. Here are the chi Idren— Pom, Flora, and Alexander. 
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k. Here are Cousin Arthur and his friend Zephir, the monkey. 

(introduction of actors; the discourse topic might be charac- 
terized as 'the Babar family' or 'who everyone Is'.) 

5. Queen Celeste and King Babar pack their crowns. 

(More Introduction; the topic Is 'who everyone Is and what they 
are doing towards getting ready for the trip'.) 

6. Babar puts his crown In a little red b^. 

(Topic Is 'what Babar is doing towards preparing for, the trip'. 
The most Interesting fact about this sequence Is that It focusses 
on Babar 's crown and the little red bag, which becomes very Im- 
portant to the story later on. Thus, the function of the sen- 
tence. I.e., to foreshadow the future action Is what Is important 
here. The topic is not terribly revealing.) 

7. Now the Babar family Is on the train. 
(Topic Is perhaps 'the Babar family's trip'.) 

8. The train is coming Into Paris. 

(Topic is still 'the trip to Paris', with perhaps a subtopic 
' travel 1 Ing by train' . ) 

9. "I will show you everything," says Babar. 

10. "You will love Paris." 

(What Is the topic here? Perhaps still 'the train trip to ^arls'7) 

11. Now they are off the train with all their bags. 

(Sets a new scene; perhaps topic is still 'the trip to Paris'.) 

12. The Babar family is waiting for a taxi. 

(This is a continuation of the action; the topic might be charac- 
terized as 'the family in Paris'.) 

erJc 
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We could go on through the entire story with these rough descriptions 
of what the sentences are about. However, all we would have Is what the 
sentences are about, according to our intuition^, without any methodology 
to tell us how to direct such intuitions. Furthermore, it would not tell 
us anything reveal ing about how this^notion contributes to the process of 
constructing a coherent interpretation of a text. 

Rather than a strictly intuitive approach, we could perhaps employ a 
methodology of ''filling in'* the missing inform; ion which, it might be 
maintained, will provide the topical connection between sentences. For 
example, we might try to turn the story into one. or more long sentences: 

1. The Babar family is going to Paris (and toward this end,) 

2 . everyone is packing bags . 

This will become awkward to do, however, when we encounter direct discourse, 
as shown here: 

7, Now the Babar family is on the train , (and as for this train,) 
, 8, the train is coming into Paris , (and with respect to P«^ls,) 

9-10. " I will show you everything," says Babar. "You will love Paris." 
Now while such marking of the relationships Is possible, we are faced with 
at least two problems. First, we must have a set of criteria whereby the 
marking of the relationship Is done by some specified procedure, so that any 
investigator will arrive at the same relational phrase. Secondly, w^ will 
want to be sure that the relationships that we are marking are relationships 
of top I c , rather than something else, say, logical relationships between two 
sentences. Finally, we must also be able to indicate that sentences like 
Babar's direct discourse in 9 and 10 are somehow subsidiary to the main top! 
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of 'the family going to and arriving In P%rl$'. For this we need at least 
a hierarchy of topics and subtopics. We lack a methodology for performing 
such an analysis at present. 

An analysis of discourse topic In an expository source like DESERT Is 
somewhat more Intuitively satisfying, as shown In the fol lowing fragment: 

1. What Is a desert?. 

(This sentence sets the major topic, 'the desert'.) 

2. You think at once of a place without water. 

3. You would think that the desert does not have enough rain for 
things to grow. . t,' 

(The topic here Is 'what Is generally felt about deserts'. The 
author Is using these sentences as a device for^ntroduclng his 
main thesis, which Is . . .) 
k. Yet there Is life In every desert. 

(This sentence quite clearly Is the topic sentence of the text.) 

5. Plants, animals, and people live there. 

(This sentence has a topic 'there Is life In every desert'. It > 
sets the structure of the text to come. I.e., It Introduces the 
subtopics of 'plants, animals, and people In the desert'. This 
sentence Is a statement of support for sentence A.) 

6. How can this be? 

7. It Is a wonderful story. 

(These sentences clearly relate to the topic 'there Is life In 

I If".' 

\, 

the desert.' but how, exactly, we have not yet been able to charac 
terlze.) 
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NEW PARAGRAPH 

8. In the desert It may not rain for five years. 

9. Then one day a storm comes. 

10. A heavy rain falls. ^ 

(All three of these sentences should probably be taken as dne 
topical unit, setting the scene for a new subtopic, 'plant^ in 
the desert. ' ) 

11. All /Tt once the desert Is covered with green grass. 
(Topic lis 'how plants manage to live In the desert.') 

12. You can see many small flowers. 
(Parallel to sentence 11.) 

^ 13. They grow very fast. 

}k. In one week they go from seed to flower and back to seed. 
(Elaboration on sentence 12; the subtopic here Is 'flowers 
in the desert. ' ) 

15. Then the seed lies In the sand, perhaps to wait another five years 
for rain. 

(This sentence continues the action from 1^; the topic is still 
'flowers In the desert,' but It also makes a back- reference to 
the beginning of the paragraph, sentence 8, to tie up the para- 
graph qui.te nicely.) 
The analysis of DESERT In terms of topic Is somcwbat more satisfying, 
for we can more readily Identify what each of the sentences Is about and 
how they relate to the exposition as a whole. Does this mean that the 
notion topic is applicable to descriptive discourse, but not to narrative 
discourse? Possibly. Accepting an affirmative answer to the question ifieans 

7 ^ *^ . 
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that accepting topic as a linguistic construct entails that we take the 
view that different genres or text-types are constructed of different kinds 
of basic constructs. In either case, a non-intui tive methodology is still 
necessary befpre this sort of analysis can be taken seriously. 

It might be supposed that an analysis in terms of thematic (or given) 
and rhematic (or new) Information might be more insightful for narrative 
texts such as BABAR. Some theories, especially those of the Prague School 
linguists, e.g., Dane? (197^). propose that we can identify those elements 
In a sentence that'are ol<i^or given or thematic information as those ele- 
mcnts that can be assumed to be known to the reader from previous context. 
Following are the results of an attempt to apply this notion to the first 
five sentence of BABAR. Old (known, given) Information is represented in 
bracfts. 

Title: Babar Loses Hjs Crown 

1. The Babar family Is going to Paris. 

2. Everyone is packing bags. 

(Here, it Is debatable whether 'everyone' should be marked as 
known information. If we make the assumption that the Babar 
family consists of individuals, then we might say that one could 
infer the existence of these Individuals, and so, 'everyone' is 
in some sense known to the reader. On the other hand, the reader 
does not necessarily know that It will be the entire family that 
is'packing, and so the known Information might be claimed to be 
the semantical ly el lips I zed 'of them.' Had the sentence been 
'Every one of them is packing bags,' then we would have marked 
'them' as known.) 
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Here are the chl ldrei>--Po(n, Flora and Alexander. 
Here are Cousin Arthur and {his} friend Zephir, the lionkey. 
(What about t^e phrase/ the children'? Given 'family,' one .can 
Infer "possibility of children" at least. So, In a sense, "the 
children" nvlght be characterized as 'known,' although there^^ Is 
no prior mention of them. In sentence A, we can mark 'his' as 
known, since Its referent has Just been Introduced.) 
Queen Celeste and King Babar (pack) {their} crowns. 
('Babar' Is mentioned In the title of the story. It wight appear 
that we should then mark- this element as "old," but we can also 
make the case that having read the title, the reader docs not 
necessarily know who Babar Is, or what Babar Is. Therefore, we 
do not mark thl^ element as old Information. The possessive 'their' 
Is marked for the same reason as 'his' In sentence k. The verb 
'pack' may be marked as some sort of^old Information, since the 
reader knows from sentence 2 that everyone Is packing bags. But 
verbs, unlike nouns, do not refer. This Immediately raises dif- 
ficulties of an^ysls, for the term "Information" Is d rather 
nebulous one. Docs It refer to concepts, or propositions, or what? 
There Is clearly some ^ded coheslveness to the story with the 
knowledge that Babar and Celeste are packing, since the reader 
knows that everyone Is packing, so perhaps It Is reasonable' to 
consider the notion of "packing as old Information. But whatsis 
the nature of this old Information? What does this claim commit 
us to saying about the representation of linguistic structure or 
about the psychological processes of the speaker?) 
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As we'hope Is evident, the characterization of terms lil^e oid or given 
Information and the methodology for their assignment to text elements Is 
entirely inadequate to the task of providing a reproduceable analysis of a 
complete natural text. The whole notion of old Information wa$ proposed to 

/ 

acco^jnt for the fact that In i number of Slavic languages and other languages 
of the world, v»ord order that had been thought to be free (I.e., major sen- 
tence constituents could be arranged In most any order) could be seen as co^k- 
s trained In discourse by the character of the Information comihunlcate,;! accord 
Ing to the principle; ^old information first, new information last.* So, 
for example, sentence 5 of our discourse when translated Into a Slavic lang- 
uage like Czech might have had the constituent order Verb-Subject-Object, 
since 'pack* is termed **old Information.'* But In a language like English, 
where the restrictions on word order are much more stringent, it Is not at 
all clear what the concept of old or known Information can contribute to our 
understanding of| the structure of sentences, the structure of texts, the 
structure of English, or the structure of knowledge. 

Perhaps if the notion of old information were clarified and refined, we 
"might find that the ratio of new informatlon^to old information In stories 
should be lower for one level of reading ability than another, but with the 
present state of the art, we find ourselves incapable of even beginning such 
a study. Most linguists believe that there is something to the concepts of 
topic and Information structure, and ^ we wuuld propose continuing research 
In this area with an eye toward tighten. ng up definitions arid methodology. 

3 • -I 
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2. Rhetorical D evices 

a. Introduction * 

This sanction contain^ discussions of a number of syntactic devices 
exploited In texts to Influence the way Inforirtatlon contained In the text , 
will . be appreciated. Some of these, like the various (usually preposlng) 
devices Which In narrative texts Indicate that the narrator ascribes par- 
ticular Importance to the events or Individuals described, may help the 
reader to anticipate events or typ*s of events, and reconstruct the struc- 
ture of the text. Otheifs, such as r*hetorlcal questions and Interjections 
like "Poor Babar!" which have forms which llfhicate that the narrator Is 
addressing the reader personal ly, may have motivational functions* All of 
these speculated functions are amenable to experimental verification, the 
results of which could be appi led almost Immediately In the writing and 
' editing of basal readers and other texts Intended for reading Instruction 
and reading Improvement. 

b. Some Examples from a Narrative Text 

There are a number of places In BABAR where the author shifts from his 
standard detached and unbiased narration. Tagging each of these with labels 
would be premature, since the data to be discussed are mostly one of a kind. 
Further study of real and made-up texts needs to be made to see If any 
generalizations will obtain. Here» however > are some of the passages and 

•A 

some rough characterizations of them which Identify devices meriting further 
attention. 

(93-98) But Babar can't eat. He Is thinking about his 
crown. He needs it tonight. He must wear it to the 
opera. He fears his crown Is gone forever. 

O / 0 (> 
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The author Is clearly speaking as If from inside the character's head here. 

» 

tlfc third and fourth sentences especially might well have been quotes,. I.e., 

Babar thinks. "Ineed It tonight. I must wear It to the.opera ." Nowhere 1n 

BABAR Is there any thought-quotation. This seems to come closest. 

(S^i-SB) The Babar family circles around him; Babar says, 
"Ahem!" The man looks up. He Is not the Mustache -man at 
all. "Oh, excuse us," says Babar . .' , 

The third sentence Is Interesting because of t|je phrase at all . Again, ^he 

narrator Is speakl.ng from the protagonists' point of view. The phrase irt 

airis oral syntax, and reflects the expectations of the speaker. A more 

neutral description of the scene would be a"(;compl Ished by something like 

He Is not the Mustache-man or He turns out not to be the Mustache-man . 

Linguistic forms which reflect speaker ' att I tude are pervasive, even In 

English. Analysis of the rhetorical exploitation of„5uch forms would be 

greatly aided by even a preliminary dictionary of such forms-. 

A final example: 

(115-116) Poor Babarl His crown Is lost again. 
This Is clearly a plea directly from the narrator for sympathy for Babar. 
Also, the use of the word 'lost' reflects his taking Babar's point of view 
in a subtle way. Previously, 'lost' was used only in direct quotation of 
Babar. Here, Instead of describing Babar's thoughts, as he previously did 
In sentence 98, "He fears his crown is gone forever," he takes Babar's point 
of view, and simply speaks of the crown as missing, as If it were his own. 

3. Connective Devices 

This section contains dlscusstoae of three Important elements which 
function to weave the threads of discourse In a text Into a coherent entity. 
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These jre 1) scene- set ting seintences., which serve to mark the boundaries of 
units of text defined by physical location of characters; 2) conjunctions, 
which specify relations between the clauses they Introduce and other clauses 
or larger chunks of text; and 3) anaphoric devices, which relate references 
to Individuals within the text, subject to a set of often subtle constraints 
which Interact In ways^that are not yet entirely understood. / 

All of these devices may affect comprehension by making It more or less 

difficult for the reader to reconstruct the structure of the/text, so that he 

■/ 

she will be abl^to perform such processing tasks as are nicessary for a full 

/■ 

appreciation of the text, e.g.,. forming expectations a$^ to what will come next 

/ 

picking out relevant details In what he or she Is prxicessing or has just pro- 

/' 

cessed, etc. Once It has been ascertained exact 1;^ gpw the devices reflect 

»* /' 
and signal Information about structure and otfjer discourse- level properties, 

♦ > 
It will be possible to evaluate texts with Regard to them, and to establish 

how much of a factor they actually are Irt determining comprehension. This 

done, there may be Implementable Implications for the writing and editing of 

texts at ^1 1 levels. / 

a. Scene Setting 

In this se9tlon, we examine the syntactic and lexical markers of dis- 
course structure which signal scene-setting sentences. The purpose of such 
sentences Is to locate the new scene or episode In time or space. To see 
what kinds of generalizations could be made about the 1 Ingulstic form of 
such sentences, we picked out those sentences in a narrative text (BABAR) 
which denoted a new location or time for the main characters. I.e., the Babar 
family. If the story described In detail how the family got to a particular 
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location ^tlme, then any sentence mentioning this location or tl?.5 was 
not Included. Following are the sentences that we Identified as scene 
setters on the basis of the contribution of tSeIr semantic content to the 
development of the narrative: 

7. Now the Babar family Is on the train. 
, 11. Now th».y are off the train with all their bags. 

17. In their hotel room, Celeste opens all the bags. 

32. So out they go, looking for the man with Babar's bag. 

35. Now they are up In the Eiffel Tower. 

^0. A boat Is ready to take off. ^ 

68. Arthur Is standing In the sun to dry his clothes. 

87. Now It Is noon. 
104. The Babar family Is In two taxis. 

137. Back In their hotel room, they say goodnight to the children. 

143. They arrive at the big opera house. > 
It turns out that many of these sentences have one or moreVsyntactIc or lexi- 
cal properties which make them natural candidates for a scene-setting func- 
tion. Sentence 7, 11, ^5, and 104, which locate the characters at a partlcu- 
lar place, are all of the form Now Subject be Prep. Location. In addition, 
sentence 32 'contains a directional adverb or preposi t Ion which Is brought to 
the beginning of the sentence. Sentences 17 and 137 contain what might be 
caH^b thematic adverblals or locatives , preposed phrases In the sentences 
which .serve to locate the action In time. Sentences 68 and 143 contain the 
verbs arrive and stand, which might well be classified as scene-setting verbs. 
Notice that either ot these verbs can occur after there, as In There stood 
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three men on the beach or There arrived a swrm of beet Just •% the «unwner 
ended . Finally, we have sentence 87, Now It Is noon , which explicitly sets 
the time of the scene. 

Each of these scene-setting sentences, th«sn, appears to have one or more 

mariners characteristic of scene-setting. Further research is, of course, 

II 

required before we know whether sentences with these syntactic properties 
necessarily serve a scene-setting function, whether what we have classified 
as scene-setting has any psychological reality, and what other linguistic 
forms may serve a scene-setting function. 

b. Conjunctions » 

Conjunctions play a crucial role in providing the reader with informa- 
tion both about the story structure and about the story content. We present 
here a brief list of the categories of conjunctions that we have found in 
one narrative text, BABAR, and a discussion of what we believe to be their 
importance to discourse analysis and discourse understanding. 

We omit any discussion of conjoined phrases, e.g., as in the sentence 
Here are Cousin Arthur and his friend Zephir. the monkey , since these do not 
affect the text as a whole and are strictly sentence- internal conjunctions. 
Conjunctions between clauses of the same-4^entence are included, since the 
two clauses could be considered as separate sentences for analyses on the 
level of individual propositions. 

The most frequent conjunction In BABAR Is but . Sentence- inl tial But_ 
must be distinguished from the use of but^ to conjoin parallel clauses within 
a sentence. The latter usage introduces a clause whose content Is contrary 
to the expectation that the writer Intends to be generated In the reader by 
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the preceding clause or clauses within the same sentence* Sentence-Initial 

But makes a similar contrast, but not between the following clause and the 

one immediately preceding It, which Is in a different sentence, but between 

the following clause and some proposition at a higher level of discourse. 

This proposition may be a stated one, or one which is merely intended to be 

inferred from what is stated somewhere in the preceding discourse. 

We begin our analysis of some of the connective functions of but in 

BABAR with an instance of sentence-initial But : 

(30-39) i/f/f^'i) "I need my crown!" says Babar. "I must wear it 
tonight!" "Don't worry," says Celeste. "We'll find that Mustache- 
man. We'll look all over Paris till we do." 

So out they go, looking for the man with Babar 's bag. 
"He may be up in the Eiffel Tower," says Celeste. "All visitors 
to Paris go up there." 

{Iflf^7) Now they are up in the Eiffel Tower. But the man with 
Ba,bar's bag is not. 

But here marks the fact that the family expected to find the Mustachs-man 
in the Eiffel Tower and, contrary to their expectations, did not find him. 
The expectation that the man would be there was created by Celeste's state- 
ment, "He may be up in the Eiffel Tower,'* and by the fact that the family 
did Indeed go up In the Eiffel Tower. (Note that no explicit statement that 
the family went up in the Eiffel Tower for the purpose of finding the 
Mustache-man is made — this must be inferred by the reader.) 

The next occurrence of but follows the previous discourse; i.e., on 
pages 17 and 18 we read: 

(38-^2) (##17) Now they are up in the Eiffel Tower. But the man 
with Babar's bag is not. "Look at the boats down there!" the 
children shout. "Let's go for a ride." 

(##18) Babar is sad, but he goes along. 

er|c Hi 
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The Interpretation of the clause-conjoining but here requires quite a de- 
ductive chain. To fully comprehend this sentence, the reader must assume 
that when one Is sad, one Generally does not feel like doing something 
that Is enjoyable, or perhaps that when one Is sad, one does not feel like 
being around other people. Now with this as a cultural presupposition, the 
reader must have the world knowledge that taking a boat ride Is supposed to 
be an enjoyable thing. An Instantiation of the general cultural presupposi- 
tion, then, leads to an understanding of why Babar's going along for the ride 
Is contrary to the expectation of the probable behavior of rfiabar, given the 
knowledge that he Is sad. 

' The next occurrences are In the discourse from pages 23 through 27. 

(5^-70) (##23) The boat does not stop. So Arthur dives off. 
"I'll catch him," he cries. "I'll catch that Mustache-man 1" 
(##2'») Arthur climbs out of the water just as fast as he can. 
He Is a41 wet. He sees the man with the bag. (##25) Arthur 
calls to him. But the man does not hear him. He Is walking 
away. (##26) Arthur runs after him. He waves. He yells, 
"Come back. Mister 1 You have Babar's crown I" (##27) But 
now the man Is on a bus. The bus goes down the street. Now 
the Mustache-man Is gone. 

The occurrence of but on p. 25 Is simply a contrary-to-expectatlon conjunc- 
tion, which could have joined the clause containing It to the preceding one 
In a single sentence. The second occurrence, on page 27, Is somewhat more 
complicated. But^ Is contrary to expectation, but It Is also combined with 
now to signal a change In the temporal setting. This birt does not crucially 
refer to the proposition that the Mustache-man Is on the bus, but rather to 
an Inference that Arthur has failed to get the Mustache-man's attention and 
stop him from going away. 
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The next but is different. On page 28 we find: 

(71-75) Arthur is standing In the sun to dry his clothes. 
Suddenly he sees the whole family. They come running toward 
him, "I'm so glad to see you," he says, "But the red bag-* 
1 t got away." 

Here we have a bat of contrast , Arthur has stated that he is glad to see 
the family. This is a happy event. Now he must tell the family the sad 
news, that the bag got away. Notice that we cannot view this but as con- 
trary to the ex^ectations of the family, for If that were the case, Arthur 
should have been able to use a discourse- Ini tial , "But the red bag got away," 
But this sentence would be anomalous In context. Rather, we must see the 
conjunction here as contrasting the happy with the sad. 

It is also very interesting to look at those contexts where but could 
be used and is not, and also those contexts which would be Incomprehensible 
without the conjunction, 



We find one case in BABAR where the^*^nj unction but would fit nicely, 
but Is not used. This Is in the discourse on pages 20 through 23: 



(47-54) (^^20) Suddenly Zephir shouts, "Babar, look! Up on 
the bridge! The Mustache-man! He is there with your bag!" 

(^^21) The children all shout, "Captain, stop! Stop the boat! 
Let us off!" 

(^^23) The boat does not stop. 



But would have fitted quite nicely as the first word of the last sentence 
here. The situation Is perfect for a contrary- to^expectatlon conjunction. 
We might wonder whether the fact that the conjunction Is not present hinders 
comprehension,, 

In the sentence that we have looked at above, Babar Is sad, but he goes 
along , the conjunction must be Included. The discourse would be decidedly 
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odd If the two clauses were made Into separate sentences with no text con- 
Junction uniting them, I.e., 

"Look at the bpats down there!" the children shout. "Let's 
go for a ride." 

Babar Is sad. He goes along. 
The last two sentences seem entirely unrelated witho&t the conjunction. 
Clearly a lot more work needs to be done both on the comprehension of sen- 
tence connection with and without conjunctions and on- the theoretical lin- 
guistic aspects of when conjunctions are required by the grammar and when 
they are not. This as an almost er tirely uncharted area. 

' Another Interesting kind of :ext conjunction Is one we call a topic 

conne ctor conjunction . The sole example In BABAR on page 52: 

' \ 

(13^-137) Sadly they come up from the subway. Babar says 

nothing. He Is very, very sad. And the children are very 
tired. 

The last sentence here Is the one of Interest. The reason It Is pf such 
great Interest Is that It really does not relate to anV 6f the previous 
discourse. The Babar family has been chasing the elusive Mustache-man all 
over Paris and has lost him once again In the subway. The topic at the be- 
ginning of page 52 I s .clear ly Babar ' s feelings, the conjunction and^ of the 
final sentence ser»/es to unite the final sentence with the prevlous state- 
ments about Babar by refocussing the topic from Babar's feelings to feeling 
of the protagonists In general. Notice that the discourse would sound less 
connected without the conjunction, i.e., 

Sadly they come up from the subway. Babar says nothing. 
He Is very, very sad. The children are very tired. 

The function of the conjunction and Is perhaps one of forcing the cohesion 

of topic within the same scene. 
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Now and then, which are traditionally considered to be temporal adverbs, 
may also be seen as kinds of topic connecting or topic establishing conjunc- 
tions. Now , especially, functions throughout BABAR to convey the informa- 
tipn that a new scene or new event is about to be introduced. It typically 
Introduces a sentence which describes the accompanying Illustration. With- 
out the Illustrations, rather long ch^ains of inferences are required to 
connect half of the sentences beginning with now wl th the preceding d is- \^ 
course. Some examples: ^ 

(5-8) (##3) Queen Celeste and King Babar pack their crowns. 
Babar puts his crown In a little red bag. 

iffffk) Now the Babar family Is on the train. The train is 
coming Into Paris. 

The Inference must be made that the Babar family got on the train. 

(36-40) Hfff]k) "He may be up In the Eiffel Tower, says 
Celeste. "All visitors to Paris go up there." ^ 

(##17) Now they are up in the Eiffel Tower. But the man 
with Babar Vs bag Is not. 

Again an Inference, that the family went up In the Eiffel Tower must be 

made. Examples like these occur at the beginning of scenes. 

The other now ' s^are more dlff Icul t to analyze. For example, we see on 

pages 26 and 27* 

(64-70) Arthur runs after him. He waves. He yells, "Come, 
back, Mister! You have Babar *s crown! 

But now the man is on a bus. TheJbus goes down the street. 
Now the Mustache-man Is gone. 

The first now is a scene-setting conjunction, descriptive of the Illustra- 
tion and requiring an Inference that the man got on a bus. The second now 
Is different. It reflects the point of vievjir^ the episode-protagonist, 



107 

Arthur. Whereas the Illustration-descriptive, scene- Introduction now refers 
to the present relative to the reading of the story (that Is, It reflects 
the language a narrator would use If the story were being told or read with 
the Illustrations as cues), this now refers to the present time relative to 
the event being described. 

The conjunction then provides us with some Interesting properties as 
well. It appears that this conjunction can be used to signal a change or 
break-up In an action sequence. For example, 

^ 

(117-123) They get out in front of a market. "I guess we'll 
* have to forget about my^ crown," sighs Babar. So the children 
begin to run and play. They race around. They hide behind 
boxes. 

Then they see another man with a small red bag. All the 

children rush after him. " 

Notice that without the conjunction then in this discourse, the sequence is 
incomprehensible, I.e., 

So the children begin to run and play. They race around. 
They hide behind boxes. They .see another man with a small 
red bag. 

A paral lei Ism Is set up here with sentences of siml lar.structure and length. 
I.e., They race around. They hide behind boxes., , and so for the reader to 
be able to appreciate the Importance of the fact that the children see 
another man with a bag, the author must mark such Information as not on a 
par with the preceding sentences. 

Much more work, both theoretical and experimental Is necessary before 

we will be able to describe when or where this kind of marking is required 

/ 

to facilitate comprehension. 




108 

'i 

c Anaphoric Devices 

A major cohesive device in any text is that of anaphoric reference , or 
the reference of a noun or noun phrase to some prev^ious mention of that item. 
The simplest forms of anaphora are those of coreferent lal nouns or pronouns. 
For example, in 

(5-6) Queen Celeste and King Babar pacic their crowns. Babar 
puts his crown in a little red bag. 

Babar In the second sentence refers anaphorlcal ly to the phrase King Babar 
In the first. His in the phrase his crown also refers to Babar. These ana- 
phorfc references are ubiquitous In every kind of text. That is, we typi- 
cally find pronominal references following close behind their referents. 

There are In the texts we examined,' however, a .fair numbe/ of anaphoric 
references that are not so straightforward. In this section, we will give 
some representative examples of these and discuss them In terms of the 
problems such uses of anaphoric devices may pose for comprehension. 

1) Reference analysis . One aspect of discourse that can be studied as 
part of a text analysis progriam concerns how characters, objects, events, 
places, etc. are. first Introduced into the discourse and then later referred 
toanaphorical ly In terms of either a definite pronoun or a definite descrip 
tlon.^ Such an analysis can provide the data needed to answer many inter- 
esting questions about a text, including the following: 

a. What particular assumptions underlie the use of each 
definite description In a text and are those assumptions 
justified? (A definite description may be used either to 
introduce a character, object, etc. or to refer to it ana- 
phorlcal ly. II is a unique description in that in using 
it, the speaker/writer makes one of two general assumptions. 
One assumption is that there is one and only one discourse 
entity which the listener/reader is already aware of that 
s/he would associate with that description. The other 
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assumption is that in using it, the 1 i stener/ reader knows that 
thei<e is (or can be) only one object so describable and creates 
a unique discourse entity accordingly. Given the first general 
assumption, a reference analysis can be used to identify both 
the reason that the speaker/writer assumes a definite descrip- 
ti6n will pick out any discourse entity the 1 i stener/ reader is 
aware of and the reason that s/He assumes that it can discrimi- 
nate among all those so known* Given the second general assump- 
tion, a reference analysis can be used to identify what knowfledge ^ 
of the world or of English cho listener/reader must possess in 
order to guarantee such uniqueness.) 

b. To what extent are the characters, objects, etc., separablf, 
on the basis of the descriptions given or derivable from the 
text? (Low separability may lead to con f us ions . ) 

c. What is the rate at which di scourse ent i t ies are introduced? 
(Too many in rapid succession may be tpo great a load on pro- 
cessing and/or memory. ) . 

d* How much text/narrative intervenes between a discourse enti^ 
ty*s introduction and a subsequent anaphoric reference to it? 
(If the gap is large, thiere may be insufficient content to the 
anaphoric expression to find the Intended entity.) 

e. At any given point in the text, which discourse entities 
does the speaker /wri ter assume the 1 i stener/ reader Is focussed 
on such that they can be accessed via the minimal cues of de- 
finite pronominal reference? If there are several entities 
accessible via the same pronoun, on what basis (contextual 
and/or inferential) does the speaker/writer presume the lis- 
tener/reader can identify the intended referent? (This is the 
pronoun resolution problem discussed at length in the ^1 and 
cognitive psychology literatures (cf. Charniak, 1972; Windgrad, 
1972; and Chafe, 197^, among others.) 

^ This section is based on preliminary analyses of both Babar Loses His 
Crown and the passage entitled ''Indian Occupation*' (page 38 above) which note 
how characters, objects, etc., are introduced contextually and how they are 
later referred to anapho^-lcal 1y. In this initial work, we have focuiised on 
the first question above, since any of a wide range of skills and knowledge 
--syntactic, semantic, factual, etc. •-may be called upon to justify a partic- 
ular definite description. 
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The analyst* '» based primarily on research reported on la Nash-Webber 
(1978a, b). Before describing our method of analysis and Its application 
to these two texts, It would be useful to understand some of ^he fundamental 
assumptions guiding our approach to research on reference. The centra! 
notion Is that of a discourse model. We assume that one objective of dis- 
course Is to communicate a model: the speaker/writer has a model of so*;* 
situation which s/he wishes to communicate to a 1 1 stener/ reader. Thus, the 
ensuing discpurse Is, at one level of Interaction, - an attempt by the speaker 

to direct the listener In synthesizing a similar model, (in this sen^e, we 

/ 

are equating "understanding" with "synthesizing an appropriate node!.") 

lnfoi*mally, a discourse model may be described as the set of entitles 
"naturally evoked" by a discourse and linked together by the relations they 
participate In. These are the discourse entitles that were mentioned above. 
In order to understand what =we mean by the notion of entitles "naturally 
evoked" by a discourse, consider the fol lowing sentence: 

1. Each 3rd grade girl brought a brick to Wendy's- house. 
Then consider each continuation In example 2. In each case, the referents c 
the definite pronoun (e.g., "she," "It," "they") would be an entity "natu- 
rally evoketf' by sentence 1. 

2a. She certainly was surprised. ' ^ 

She - Wendy 

b. They knew she would be surprised. 
They - the set of 3rd grade girls. 

c. She piled them on the front lawn. 

them'- the set of bricks, each of which some 3rd grade girl 
brought to Wendy's house 

• d. She was surprised that they knew where It was. 
..It" Wendy ' s house 
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e. Needless to say, it surprised her. 
it - the brick-presenting event 

Now a speaker/writer Is usually not able to communicate at once all the 

relevant properties and relations s/hc may want to ascribe to any one of 

these discourse entitles. Tbat task requires multiple acts of reference. 

When the speaker/writer wants to refer to an entity In his or her discourse 

model, s/he has two ways to do so. One way Is with a definite pronoun. In 

uslng-^a definite pronoun, the speaker/writer assumes 1) that on the basis of 

the discourse thus far', * similar entity will be In the listener/reader's 

(partially formed) model; and 2) that the listener/reader will be able to 

access and Identify that entity via the minimal cues of pronominal reference. 

The referent of a definite pronoun Is thus an entity in the speaker's dis- 

I 

course model which s/he presumes to have a counterpart in the listener's 
discourse model . . v ' ' ' 

Alternatively, the speaker may refer to an entity In his or her dis- 
course model by constructing a description of It In terms of some or all of 
Its known properties and/or relations (e.g., "a definite proooun or a def- 
inite description"). 

So while a discourse entity E can be the referent of a definite anaphor 
A, we consider A's antecedent to be a unique description of E conveyed to 
the listener by the immediately preceding text. The relationship between 
the discourse on the one hand and the referents of definite anaphora on the 
other is thus a direct one, mediated by the discourse participants' models. 

It Is our belief that one can formalize, at the sentence level, rules 
for deriving unique Jescrlptlons of the discourse entities evoked by a text. 
A prel'lmilary set of eleven formal rules which are sensitive to such aspects 

12 '1 
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of a sentence as how each noun phrase is determined, what the relative scope 
Is of each quantifier, and what dependencies exist between noun phrases due 
to relative clauses can be found in Nash-Webber (1978a, b). As given there, 
rules do not tal(e Into account teitse, modality belief and deontic contexts, 
arid certain aspects of negation, all of which can be shown to be necessary 
factors in forming appropriate unique descriptions. However, in performing 
our preliminary reference analysis, we have intuitively extended the rule^ 
to cover these aspects as well. 

As ah example of these rules, consider the following one, RW>-t, which 

\ 

appVies to propositions in which a singular existential quantifier (i'.e., a 

singular Indefinite noun phrase) has the widest scope. 

(RW-1) If a proposition _S. is of the form 
(3x:A) .fx 
then i;t follows that 

(3x) . ± ■ Vl\ Az & Pz ,S evolve Ij » £ 

I.e., Informally, if a proposition states that there^is a member x of class- 

A for which £ is true, then there exists a discourse entity describable as 

"the A w^ich£s which was mentioned (or evolved) by the proposition." (Here 

j, stands for Russell's definite operator, iota.) Since a unique description 

/■ 

can be ascribed to this discourse entity, it can be referred to with a def- 
inl te anaphor. 

There are many places in the Babar story where the application of thjs 
rule, RW'l, accounts for both the existence of a new discourse entity and 
an appropriate unique description for it. A particularly straightforward 
example Is the first sentence of page 20 of the -story, which we shall label 
20.1 for convenience. 



20.1 The boat Is going toward a bridge. 

As a first approximation— that Is finessing the semantics of "going toward" 
—sentence 20.1 can be represented as , ^ . 

(3x^:Brldge) . Golng-toward bi^ , 
where b^. Is a unique label for the discourse entity referred to anaphorlcal ly 
with the definite description "the boat." Since this matches the left-hand 
side of rule "^W-l, It follows that 

(3^) . X ■ Ji.' Bridge z $ Going- toward b^^ .£ & evoke Ijo.l^- ' 
I.e., there exists an Individual discourse entity uniquely descrlbable as 

"thetbrldge which was mentioned in Sentence 20.1 which. the boat was going 

' ^"^ ' . « ... 

toward." This is the discourse entity referred to anaphorlcal ly via the 

definite description "the bridge" In the very .next sentence. 

20.2 Suddenly Zephi r shouts, "Babar, lookl Up on the bridge." 

In our prel Iminary analysis, of the two texts BA-BAR and INDIAN, our method 
of analysis was as follows: wp went through the, texts sentence by sentence 
considering each noun phfaie In turn. To eacb Indefinite referential noun 
phrase we applied the appropriate rule given In Nash-Webber (1978a), extended 
intuitively' as mentioned earlier. We assigned the discourse entity that was 
evoked a ^bel (e.g., e ^ , e^, etc..) and Its unique description. For each 
definite referential noun phVase, we considered whether it matched the de- 
scription of an. existing discourse entity and was therefore anaphoric.' If 
it was not, we created a new discourse entity, labelled it, assigned it Its 
given definite description, and considered, a propos of question 1, on what 
basis that unique description was justified. 
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For example, consider again sentence 20.1. 

20.1 The boat Is going toward the bridge. 

On the preceding page, the listener/reader has been told 

18.2 A boat Is ready to take off. 

18.3 "Captain, wait for us!" shout the children. 

18.4 They climb aboard. 

The Indefinite noun phrase In sentence 18.2 should have evoked a discourse 
entity In the 1 Istener/reader *s model uniquely descrlbable as ''the boat men- 
tloned In sentence 18.2 that was, ready to take off" (labelled here^^). As 
for the (ieflnlte description "the boat" In sentence 20.1, we find that It 
matches that discourse entity description (and moreover, no other one). We 
therefore take It as an anaphoric reference to that discourse entity under 
the assumption that the I Istener/reader understands sentence 1S.2 In accor- 
dance with a ruVe like RW-1;^ As for the Indefinite noun phrase "a bridge," 
we create a new discourse entity and assign It the unique description "the 
bridge mentioned In sentence 20.1 that the boat ;b^ Is going towards," again 
In accordance with RW-1. 

There were two sets of data resulting from this preliminary analysis: 
1) an annotated text, with each referential rfoun phrase tagged with the label 
of the discourse entity It islther evoked or referred to (useful for investi- 
gating questions about the rate at which discourse entitles are Introduced, 
the lag between Introduction and later anaphoric reference, etc.), and. 2) a 
list df discourse entitles (afctually their Ubels), each with Its unique 
description (I.e., In accordance with Its Initial Introduction) and any 
additional Information about It garnered from subsequent anaphoric references 
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(useful for Investigating questions concerning the separability of charac- 
ters. and the Justifiability of other definite descriptions). 

We will now' show how this kind of reference analysis allows us to make 
some Interesting comments about a text that may be Involved In Its readabil- 
ity. In particular, we will focus on the Justifiability of definite descrlp- 
tlons (question 1 above). As stated, the rule RW-1 given earlier Is only 
applicable when an existential noun phrase Is understood to have a wide scope 
over a sentence. (To put this more simply, though less accurately, for the 
listener/reader to understand an existential as having wide scope Is for him 
or her to assume that the speaker/writer has some particular 2< In mind which 
s/he nevertheless cannot refer to as "the x," since It Is not unique.) With 
this In mind. It Is clear that the rule RW-1 Is not applicable to.the second 
sentence on page 6 of the Babar story: ^ 

6.2 The Babar family Is waiting for a taxi. 
This does not mean that they are waiting for some particular taxi, but rather 
that they are waiting for any one that comes along. In other words, sentence 
6.2 should not evoke a discourse entity uniquely descrlbable as "the taxi 
mentioned In Sentence 6.2 which the Babar family Is waiting for." But now 
consider the very next sentence of the text. 

8.1 The taxi takes them to their hotel. 
How might the writer justify this definite reference to "the taxi"? It Is 
not the particular taxi the family is waiting for, since it is not the case 
that they ar6 waiting for a particular one. However, the author may be 
assuming that the listener/reader will infer that if the family is waiting 
for a taxi, then eventually it will be the case that a tux! picks them up. 
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This latter sentence, taxi picks them up/* does Imply the existence of 
a discourse entity uniquely descrlbable as **the * Just*mentjjpned* taxi which 
picked them ■ the Babar family up/* which Is presumably the referent of the 
definite anaphor **the taxi/* Thus,, the use of this definite description Is 
motivated by the writer's assumption that the listener/reader both can and 
win make this plausible, world-knowledge- based Inference about the eventual 
appearance of a taxi. Such Inference may demand a high level of sophistl- 
cation and familiarity with the real world and thu$ strain the listener/ 
reader's ability to follow the text. 

In thtf ''Indian Occupation" passage, there are many examples of definite 
descriptions which make hea^y demands on the listener/reader's knowledge of 
particular factual events Iti order to Justify and understand them. This may 
be one reason whythe text ts so hard to follow. For example, the passage 
begins with three defini te descriptions (here underscored). 

The Indians had not heard from the government . 

The suit for Alcatraz was still not settled. 
Consider first the phrase "the Indians." It Is obviously not anaphoric, 
since It Is the first noun phrase In the passage. Therefore, we create a 
new discourse entity and try to Justify the unique description "the Indians." 
We find that we cannot. Consider next the phrase "the government." It too 
Is not anaphoric, since It does not match In atiy way the description of the 
only discourse entity currently around--the one descrlbable as "the Indians." 
Again we create a new discourse entity and try to Justify the unique descrip- 
tion. Again we cannon there I? neither a unique government that we as 
readers know about nor a unique government that we associate with "the 
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Indians,** whoever they are. The same holds true for the definite descrip- 
tion **the suit for Al^atraz*** In fact, only If the 1 1 sterier/ reader already 
knows about the Indian occupation of Alcatraz will there be some Justified* 
tlon for, the uniqueness demanded by these definite descriptions. Even then, 
It win be an a posteriori Justification, since this context Is not Imparted . 
until the second sentence. The passage Is filled with such *'unanchbred** 
deflnltes, and It Is a benefit of this type of reference analysis that It 
can help to point out places where world knowledge and Inferences ba$ed on 
It are demanded for understanding. 

In the remainder of this section, we shall comment on our preliminary 
reference analysis of Babar Loses His Crown vls**^*vls the remaining four 
questions posed at the beginning of the section. Questions 2 and 3 concerned 
the separability of discourse entitles and the rate at which they are Intro- 
duced. One case of low separability and hjgh rate of Introduction Involves 
the Babar family themselves, who are Introduced on the first page of the 
text as fol lows : . 

The Babar family Is going to Paris. 
Everyone Is packing bags. 

Here are the chl ldren--Pom, Flora and Alexarxler/ Here are 
Cousin Arthur and his friend Zephir, the monkey. 

Noth^lng here permits the reader/listener to distinguish Pom, Flora, Alexander, 

and Cousin Arthur except their names. Moreover, nothing further is said 

aboujt any of these four characters Individual ly untI 1 page 23> 

I 

|The boat does not stop. 
jSo Arthur dives off. 

jin preparing our preliminary analysis, we found that we had no recollec- 

I 

tlon \at this point of who Arthur was, although the use of the name with no 
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further attributes Implied he had been Introduced earlier In the story. We 
attribute his evanescence to the number of characters Introduced at the' same 
time as he and the lack of distinguishing properties attributed to each one. 
This Is also a comment on the fourth question concerning the lag between a 
discourse entity's Introduction and a subsequent reference to It and the 
confusions It leads to. Of course, If we had attended to the Illustrations 
there would have been no problem here; all the elephants are pictured almost 
In every Illustration, and the one on page 23 shows "Arthur" diving off "the" 
boat. 

The fifth question concerned focus and the use of pronominal reference, 
In particular, the case where there Is more than one character In focus that 
Is accessible via the same pronoun. Such a case occurs In the Babar story. 
In a sequence Involving Arthur and the man presumed to have taken Babar's 
crown bag. 

24.1 Arthur climbs out of the water Just as fast as he can. 

2k. 2 Hei is all wet. 

24.3 He sees the man with the bag. 

25.1 Arthur calls to him. 

23.2 But the man does not hear him. 

25.3 He Is walking away. 

r 

26.1 Arthur runs after him. 

26.2 He waves. 

In sentence 24.3, the pronoun "he" Is used to refer to Arthur and the def- 
inite description Is used to refer to the man. In the next sentence (25.1), 
things switch, nnd the pronoun "him" Is used to refer to the man and the 
proper name to refer to Arthur. In the next sentence (25.2), things switch 
again, with the pronoun now referring to Arthur, and the definite description 
to the man. In the next sentence (25.3). "he" presumably "refers to the man, 



and two sentences later (26.2), **he** presumably refers to Arthur. The use 
of the definite description and proper name serve to keep both characters 
in focus. However, keeping them equally In focus also keeps them equally 
possible referents for the definite pronoUn "he," if context is not suf- 
ficlent to favor one over the other. The crucial question Is whether con* 
text is Indeed a sufficient cue to the Intended referent of *'he" in the two 
sentences"**He Is wa liking away** and **He waves-** Further research is ob* 
viously necessary, both to refine our methods of reference analysis and to 
Identify some measures of readability associated with reference. One current 
intuition Is that stories beginning **Once upon a time there O^^as, lived, 
stood, etc.). • •** mdy really be easier to understand, at least Initially, 
than ones which do not. 

2) Other aspects of anaphora . In this section we take up In more de- 
tail two of the questions posed at the beginning of the previous section, 
and Introduce an additional aspect of discourse reference which might be 
expected to be a source of difficulty in comprehension. We comment first on 
the subtleties of beginning to solve question ^, how much difference dis- 
tance between a pronoun and Its referent makes. Then we consider certain 
intrlqacles of the resolution problem. Finally, we take up the problem of 
Implied referents. 

As mentioned In the preceding analysis section, anaphoric reference by 
pronouns may lead to comprehension difficulty if the pronoun referent Is too 
far removed In time or topic from its original referent, as In this made-up 
discourse: 

John Is one of my best friends. And his sister Suzanne Is one of the 
best tennis players In the country. In fact, she might even make the 
International Circuit this year if her luck only holds out. She Just 

/ On 
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has to win one more tournament In California and then she's a cinch 
to be Invited. Anyway, as I was saying, he Is one of my best 
friends. ... 

Here, the pronoun "he" In the final sentence Is not very comprehensible 
after all the Information about a different topic, namely Suzanne, has been 
presented. Notice that In no way can the pronoun reference be said to be 
ambiguous— John Is the only possible .referent. Still, the time and topic 
shift may cause comprehension problems. 

In a text like BABAR, we find many, many pronoun references. For the 
most part, these references follow closely In time the lexical noun phrases 
that they refer to, as In this passage: 

(58-67) Arthur climbs out of the water just as fast as he can. He 
Is all wet. He sees the man with the bag. Arthur calls to him. 
He Is walking away. Arthur runs after him. He waves. He yells, ^ 
"Come back. Mister. You have Babar's crownl" 

With an understanding of the situation, there Is little If any difficulty 
Interpreting the referents of the he's and him ' s. 

But there Is one class of pronoun referenci • that one would suspect, on 
the basis of distance between references, would cause comprehension diffi- ■ 
cultles, but which do not In fact do so. This has to do with the references 
of the pronoun "they.". In this example, which stretches over five pages, 

(I28-I3A) (##A8) Theyall follow him, shouting, "Stop, please, 
Mr. Mustachel" (##50) Too latel" Stuck again! The gates at the 
botton of the stairs snap shut. (##51 ) "Bring back my crownl" 
shouts Babar. But the man gets on a train, and the train goes 
away. ^##52) Sadly, they come up from the subway. 

there Is quite a bit of Intervening material between the first and second 
they ' s . We hypothesize that the reason the pronoun "they" seems so Immedi- 
ately comprehensible Is that "the Babar family" Is highly topical, in that 
the concept of the Babar family Is central to the story and always kept In 



121 

~ ~ ■ 'X/ . " • 

mind. . Thus, almost any reference to the family In any way, even with a 

pronoun, may be expected to be easily understood. 

,We take up now the subcase of^the resolution problem that we may call 
con text ua 1 redef I n 1 1 1 on . As an example, let us examine the use of the phrase 
the chl Idren In BABAR. In. this passage from page 2, W(| ^ee the phrase re- 
ferring specifically to three Individuals, Pom, Floral and Alexander. 

(3-^) Here are the chl Idp^Ji—Pom, Flora and Alexander. Here 
are Cousin Arthur and hts friend Z^phlr, the monkey. 

But It has a different reference In the following passage: 

(11 7" 125) They getjwt In front of a market. "I guess we'll have 
to iPorget abou t^ ^fffcrgvin , ' ' sighs Babar. So the children begin to 
run and D^y. They race around. They hide behind boxes. Then they 
see anoOier man with a small red bag. All the children rush after 
hlm..^^rthur knocks over a box of apples. Zephir knocks over a box 
offish. 

Here chl Idren , at least by the second occurrence of the word, seems to refer 

to all five of the younger protagonists. This point may be argued here, 

but not In another example, 

(138-1^1) Celeste says, **We'11 put the children to bed In the hotel. 
Then we'll leave them and go to the opera." Back In their hotel 
room, they say goodnight to the children. The three littlest ones 
are already fast asleep. 

Now exactly who is tHred or who has been running and playing In the market 
is not terribly crucial to the story here. Nevertheless, the contextual re** 
definition of particular lexical Items is seen in these examples, and uses 
of the same lexical pdrase for different references Is a possibility In gen- 
eral and can lead to ambiguities or confusions of reference. The same prob- 
lem of contextual redefinition of reference is, of course, always present 
for pronouns, usually to an even greater extent than for lexical noun phrases. 

13" 
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Finally, we take up the problem of anaphoric reference to . impl ied ref- 
erentt . This is exemplified by instances where there Is an anaphoric con- 
nection to some object that is not overtly given in the text, but rather 

A. 

f must be inferred by the reader, as in this passage from BABAR: 

(11-13) Now they are off the train, wjth all their bags. The 
Babar family is waiting for a taxi. The taxi takes them to 
their hotel. 

Notice that the Babar family is not waiting for any specific taxi. Rather, 

they are waiting for taxi transportation. In the sentence The taxi takes 

■J . * 

them to their hotel , the taxi referred to must be some taxi which finally 
picked them up, and the phrase th e taxi must be ^hort for "the taxi that 
picked up the family." No prior reference is made to this taxi and no men- 
tlon Is made overtly in the story that there was a taxi that picked up the 
Babar family. The reference to taxi is, then, a reference to an img^ied 
referent. 

Although the question of the proper logical and psychological analysis 
^ of Implied reference Is a difficult one, with frame analysis offering one 
prom^lslng possibility, our suspicion Is that references to Implied referents 
will not typically create comprehension difficulties, except possibly where 
knowledge of the frame re^rred to Is Imperfect. 

C. Te X t " Le ve 1 I n f e r en ce s 

In this section we discuss the analysis of Inferences that have to be 
made in comprehension of text which Is Intended as connected discourse. Our 
analyses are not Intended to provide an exhaustive listing of all possible 
Inferences that could or must be made from the use of Individual sentences. 
Rather, we have concentrated on enumerating Inferences that must be made In 
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order to understand the (Intended) relattbn between (or among) the sentences 
of a text; in other words, Inferences that the author must have Implicitly 
intended to >e made, a^ opposed to ones which are merely possible, given the ' 
meanings of the lexical items and the rules of syntax. 

As an Illustration, let us consider a couple of cases. The book Babar 
Loses His Crowti begins with the following sentence: ; 

1. The Babar family Is going to Paris. 
Among the Infef^ences that It Is possible to make from this sentence are the 
following: 

la. There 1$ a Babar family. 

lb. There are at least two individuals related by blood or marriage 
who are going, to Paris. 
. 1c. Babar Is their last name. 

Id. Paris Is a place. (It could be a city, a country, a planet, a 
neighborhood, a restaurant, or a department store, apud alia .) 
le. They are not In/at/on Paris now. • 

But inference (lb) follows as an automatic consequence of understanding the 
meaning of the word fami 1y , (id) and (le) follow from the meanings of going 
and to, and (la) follows from linguistic conventions on the use of the word 
the. Inference (ic), whTch happens to be incorrect (Babar is the father's 
first name), follows from culture-specific linguistic conventions about re- 
ferring, to families. But what all of these inferences have in common Is that 
they are inferences that can or must be made of the sentence in isolation : 
the same inference could be expected to be made If the sentence had been 
uttered as the Initiation of a conversation. In other words, these Infer- 
ences are Independent of the use of the sentence In a connected piece of dis- 
course. Since ability to make these Inferences follows automatically from 

knowledge of the language plus the ability to reason, they do not have tde 

/ ■ 
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direct kind of bearing on reading competence that inferences about th^ re«* 

let ions among sentences have. 

To contrast these types of inferences 9 let us examine another case. 

These two sentences are also* from BABAR. 

93' But Babar can't eat. 

Sk. He is thinking about his crown. 

The following inferences may be made about the relation between sentences . 

(93) and iSk) : 

9'ia. Babar is thinicing about his crown. 

9'»b. The reason Babar can't eat is that he is thinking about his 
crown. 

9iic. Thinking about his crown makes it impossible for Babar to eat. 
'Inference (9'»a) is not of so much interest here as it is an inference of the- 
reference of the pronoun lie, and^depends on knowledge of conditions on the.' 
use of pronouns discussed in Section iV B 3 of this report, inference (9'»b) 
Is presumably the one the author intended «the reader to make: he presumably 
Intended sentence (94) as an explanation for the, state reported in sentence 
(93). What about inference (9'»c)? While iSkc) might be taken as being a 
necessary premise, along with (9'»a) , for concluding (9'»b), Which follows not 
from (9'»a) and i9^c) taken as premises, but from the fact that the author has 
provided (93) and (9^) in that order, and from the assumption that he is 
adhering to a principle of cooperative discourse which provides that unless 
warning is given to the contrary, a sentence will be assumed to be relevant 
to the point of previous discourse (cf. Grice, 1975). From this fact and 
this assumption, the reader may infer that the relevance of (9'») to (93) is 
that iSk) is intended as an explanation for (93). Note that this is not a 
necessary inference like (9'»a); the reader might be wrong about the author; 
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(9^) might not have been Intended to be relevant to (93) • Or (9^) might 
have been Intended td be relevant only Insofar as it described a state of 
Babar simultaneous with (93)* V/e conclude, however, that (94b) Is an in- 
ference that should be made for this text (BABAR) to be undel-stood--that the 
author probably did intend the two sentence (93) and (9^) to be taken as 
being related by (94b). 

Thus our enumeration of text-level Inferences concentrates on inferences 
relating sentences to each other. The enumerations are probably not complete, 
however; Indeterminacy and dependence on-the Intentions of an unavailable in- 
dividual, the author, of the sort Just described prevents this from being a 
feasible goal. We have, nonetheless, attem()ted to provide accounts of the 
Inferences that must be made In order to understand the sequences of sen- 
tences as coherent texts representing the product of some purposeful activity 

• ♦ 

Illustration and Layout Analysis 

The Importance of the role of linguistic properties of written materials 
for readability Is obvious. Less obvious, but possibly quite Important, is 
the role of non-linguistic aspects of the text, ranging from the size and 
color of the pages and choice of type face to reldti|)ns between -the language 
of the text and its visual properties. In this sect/ion we explore two prob- 
lems that belong In this domain: illustrations and their relation to the 
text, and the relation between 1 ine-layout, and certain syntactic and semantic 
properties. 

In the first section, we present a discussion of the analysis of illus- 
trations and the relations of their content to the understanding of the text. 
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Iq the second section we present dl$»CMSt1on of the relation of line-ends 
to syntactic structure and to the structure of direct quotation. 

il lustrations 

• ' I • 

1. Relevance of fl lustrations to Comprehension 

Illustrations In children's books ar;e not merely decorative enticements 
to, read. In scientific and social studies texts, and even, perhaps sur- 
prisingly, In 'many narratives, they represent an integral, part of the content. 
It Is typically the case with preprlmers and : basal readers for the first few 
grades that the texts are incomprehensible without reference to the accom- ^ 
panylng Illustrations, and eve» »^en the texts are self-cbntalned, lllustra- 

r 

t Ions' may be' sufficiently redundant that a child could become proficient 
enough at Interpreting Illustrations to lose Interest In Improving his Ability 
to decode written language. For this reason, we consider the relation be- 
tween a text\ and accompanying 1 1 lustrations to be an important object of 
analysis. 

Section 2, describes the procedures used In analysis of the redundarlcy 
of Illustrations to text, and some Issues that will have to be resolved or 
accommodated before meaningful comparisons of texts can be attempted. 

♦ 

2. Discussion of the Calculation of the Portion of a Narrative Retrievable .; 
from Accompanying Illustrations 

a. Introduction * < • . 

Two Investigators calculate'd the proportion of the text retrievable In 
the illustrated narrative ( Babar Loses His Crown) according to the procedure 
specified vaguely as follows: 
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The Procedure 

» _ , 

1. For each 1 1 lustration, provide content statements which tell 
the story told by the accompanying text. In other words, translate 
the text into a sequence of statemehts, which recounts the story 

In the same order as the text, but with no direct discourse (except^ 
where essential to th^ plot), and no rhetorical devicl»s'; State- 
ments should describe one action or setting each. 

\ 

2. For each statement, indicate what portion of the content of 
that statement is retrievable from the accompanying Illustration. 

3. Average the percent retrievable for all statements to obtain a 
measure of the amount of the text, retrievable from the accompanying 
Illustrations.. 

The vagueness of this specification of the procedure is atte&ted by the 
fact that the initial assessments* by the investigators differed by more than 
30%:. the- research assistant calculated that the illustrations told 62* of 
the story; the principal Investigator calculated that theV told roughly kO%. 
After discussing the rationale for the assessments with each other, cor- 
recting inconsistencies and over- and under-eval uatlons that could be agreed 
upon, the assessments were S2% and respectively. 

b. Problems of Analysis \ 

Several types of problems -arose tn the attempt to assess the amount of 
a narrative story that was retrievable from accompanying illustrations, and 
ultimately to arrive a^Jthe specifiable procedur^e for making this assessment. 
We describe these in some detail. ' ' 

1) The translation of ttext'into content statements . The first problem 
that, arises, both logically and procedurally. Is how to translate the text 
Intp content statements. It had been agreed that the content statement should 
be a natural narrative unit, but there were no criteria for determining how 
much text a statement could encompass, or for constraining the form of 



statements beyond the r^le of thumb that the number of statements would 

generally correspond to the number of finite verbs In the text, though they. 

need not always, as In the following examples from other texts: 

U One statement; more than one- finite verb 

Text: Debbie had a dream. It was a 
bad dream. It was about a tiger. 

Statement: Debbie had a bad dream about a tiger. 

2. One finite verb, more than one statement 

Text: Mog was nice— but not very clever. 

Statements: Mog was nice. But Mog was not very clever. 
Direct discourse was a problem In Its own right. Our attempt was to 
attend to the perlocutlonary effects of quoted speech as they would appear 
In a recounting of the narrative, thus: 

3. Dl rect discourse 

Text: "Pardon me," said the fiddler to the milkman. 
"I am a stranger In town. Perhaps you could direct 
me to a place where I might have breakfast." 

Statements: The fiddler Introduced himself to the 
milkman. He asked where he could have breakfast. 

Notice that statements cannot usefully be held to "propositions" in the 
logician's usage. It docs not help to decompose statements Hke He asked 
where he could have breakfast into component propositions. Likewise, if 
one fol lows the Generative Semantics program of semantic analysis , every pre 
dicatlve element (including quantifiers, conjunctions, negatives, and ad- 
verbs, as well as attributive adjectives and nouns) is the nucleus^ of some 
semantic proposition, but It appears that an analysis at this level would 
fragment information too much to be useful for our purposes. 
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Given the limitations that had been established, however vaguely, and 
the shared exemplars. It did not appear that BABAR would present too many 
problems of translation into statements, as there are hardly any sentences 
with more than one surface clause (13 out of 191, counting Infinitive phrases 
as clauses). However, even so syntactically Impoverished a text as BABAR 
turned out to present problems. 

To begin with the simplest, consider the case of modifiers that are not 
syntactically obligatory, e.g., the underlined phrases In sentences 11 and 

17: . ; 

11. Now they are off the train with all their bags . 

17. In thel r hotel room . Celeste opens all their bags. 
One Investigator treated wl th all thel r bags as a separate statement ( all 
their bags were with them ), as It turns out to be Important to the story, 
and In thel r hotel room as part of a statement, as It was unimportant to 
the narrative where the unpacking took place. The other investigator treated 
wl th all the! r bags as part of a statement, apparently because a retelling of 
the narrative with that content as a separate sentence accprds more Impor- 
tance to this Information than the original text does. This Investigator, on 
the other hand, treated in their hotel room as a separate statement ([ they 
go ] to thel r hotel room ; bracketed material refers to content that must be 
inferred) since this could be recounted separately without affecting the de- 
velopment of the plot or sounding bizarre. The investigators were unable to 
agree on which criteria were more valid, as each has both advantages and dis- 
advantages as summarized below. 

a. Importance of the content to the narrative: if It is impor- 
"tant, count It as a separate statement. This seems plausible, but 



130 



there are two hitches. First , It requires foreknowlege (which • 
first'tlme reader does not have): you have to know what happens 
later In order to know what Is Important at the beginning. On the 
other hand, If comprehension works In such a way that the reader's 
model of what the Important things are Is continually being revised, 
then he has hindsight by the time he Is f In I shed , which accomplishes 
the same thtn^* Second, making Importance to the 3tory critical 
would result In a content analysis where some predicative elements 
are the nuclei of content statements but many others have their 
(surface) nominal or attributive roles; some phrases In text sen- 
tences would be semantical 1y decomposed Into several statements 
while others would be parts of undecomposed statements. 

b. Coherence of the sequence of content statements as a text: 

do not make It a separate statement If It would make the sequence 
of content statements an Incoherent text. This ''transdertvatlonaV* 
criterion may turn out to be unemplrlcal, but Its purpose was to 
provide for an analysis which looked like It had a chance of cor* 
responding to a skeleton narrative from which the story could be 
elaborated 9 or reconstructed; combining statements In complex 
sentences. 

c. ' Rhetorical effect of decocnpos 1 1 loiT: do not make It a separate 
statement If \l results In emphases different from those In the 
original text. Again a "transder.l vatlonal" criterion, this has 
less likelihood of being useful, as almost any meddling with a 
text Is bound to have some rhetorical effect. 

2) Totally Irrelevant material . Some textual material Is totally 
irrelevant to the story being told, for example, Last of al 1 In sentence 18. 
18. Last of all, she opens the little red one. 

The question arises whether this would be considered part of the content of 

* 

the narrative. While this Issue makes a. difference here, since It affects 
the proportion of content retrievable from Illustrations, Its relevance Is 
not limited to Investigation of inferences from 1 1 lustrations , but will 
recur wherever It Is attempted to objectify, quantify, or otherwise analyze 
content. Some books have more irrelevant content than others. The under- 
lined phrases in the following passages from Howard Garl?' Uncle Wiggly and 
His Friends (1939, Piatt C Munk) were all Judged to be entirely irrelevant 
to the stories they occur In. 
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One day Uncle Wiggly Longears started out for a ride In his 
automobile. It had a turnip steering wheel that he could 
nibble on when he was hungry . . . . Pretty soon he came to 
a place where there was a little shop, made from corncobs . 

Then the muskrat lady, who was also camping on the Island, 
began to cook breakfast. The kind circus elephant [never 
before mentioned, note] got a pall of water In his" trunk, at 
the well Uncle Wiggly had dug . 

In fact, the whole breakfast episode In the second passage Is entirely 
irrelevant to the story it occurs In. 

3) Proper names . A third problem, which }s_ limited to research on 
illustrations, Is how to evaluate text which includes the names of individ- 
uals who figure In the narrative, particularly the material which introduces 
these Individuals. That is, when you have a text like sentences 3 and k. 
from BABAR, and an Illustration showing three smallish elephants (two dressed 
In shorts, one in a dress), one larger elephant dressed in a (French) sailor 
suit, and a monkey, in shorts and a beret, haw should the information content 
of the illustration with respect to this text be counted? 

3. Here are the chi 1 dren--Pom, Flora and Alexander. 

i». Here are Cousin Arthur and his friend Zephir, the monkey. 
All that a "picture-reader" can tell at this point is that some juvenile 
elephants and a monkey are involved (along with two adult elephants) in the 
story. This was counted as equivalent to about half of the content of sen- 
tence 3 and one-quarter of sentence 4. The underlined portion of the \ 
following statements recounting sentences 3 and k was considered to be repre- 
sented in the illustrations. 

There are some (elephant) children . They are Pom, Flora, and 
Alexander. There is a cousin and a monkey . The cousin's name 
is Arthur. The monkey is Cousin Arthur's friend. His name is 
Zephi r. 
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or 

There are (elephant) children — Pom, Flora and Alexander. There 
Is a cousin (Arthur) and his friend Ziphir, the nx)nkey . 

It must be noted that even a text-reader cannot tell which elephant has 
which natrte, not even which Is Cousin Arthur, the monkey's friend. Later, If 
the reader pays close attention to the pictures, he or she will be able to 
Infer which names go with certain elephants, but even after hundreds of 
readings. It Is possible to Identify with certainty only Flora and Arthur 
without accompanying text. In addition to the obvious Babar and Celeste, in 
any case, the names of Individuals Is something a picture-reader would never 
be able to Infer from Illustrations. What does this Imply for the evaluation 
of the retrlevabi 1 Ity of later text which predicates various actions of these 
individuals,, referring to them by nanje? Not much. It was concluded, since It 
seemed most reasonable to assume that as far as the story retrievable from 
the text was concerned, picture-readers would distinguish the characters by 
means of identi fying-expresslons which referred to the illustrations, on the 
order of "the elephant with the sailor suit" or "one of the little-boy ele- 
phants," and they would be able to tell an equivalent story about the same 
Individuals. They would just refer to them differently from the way the text 
does. 

k) Text which requires inference for interpretation . A numberSof the 
difficulties which arose involved inferences from text or illustrations (or 
perhaps, speaking more correctly, chains of inferences, since interpretation 
of straightforward text and simple illustration always involves making some 
inferences). To begin with, there are cases in which information must be 
Inferred frum the text in making the translation to content statement. But 
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It is not clear how much Is Inference and how much Is Just understanding 

sentence meaning. For example, consider the Illustration pages 10-11. 

Celeste (profile view) Is seated In front of an open red 
suitcase, grasping a flute. In her trunk. Her eyebrow (not 
shown In other Illustrations; maybe It Is a forehead wrinkle) 
Is raised (In surprise?). The other five elephants stand 
opposite, their trunks extended (In Interest?), Babar's at 
nearly 90 degrees. 

The accompanying text includes sentences 19 through 26. 

19. '^ookl" she cries. 

20. "What is this? 21/ A flute! 

22. Babar! 23. This Is not your bag." 

2k, VMy crov/n. 25. It's losti" cries Babar. 

26. "'My crown is gone!'* 
The point of sentences 19-21 is that Celeste has discovered a flute (in the 
red bag), and that of sentences 2^-26, that Babar is very upset because his 
crown is not there. In principle, much more of the inferable content can 
be retrieved from suitable illustrations than from such fragmentary exclama- 
tory direct discourse as is found here. In this case; however, only the ex- 
istence of the flute was counted as retrievable (and not its discovery, as 
Celeste^ s eyebrow is neither very salient nor unambiguous). Babar*s dismay 
could have been pictured, but it was Judged not to be. 

5) Knowledge of the world required for the fullest interpretation of 
i 1 1 ustrat Ion . A thornier problem involves cases where there is a reasonable 
question as to whether a picture-reader can be expected to be able to make 
certain inferences from an illustration. The answer to this question depends 
on how much real -world knowledge can be Imputed to the pi cture- reader , as- 
suming that he has mastered the complex strategies for interpretation of 
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abstract representations of real-world objects, (informal observations sug- 
gest that 15-month-old Infants can do this.) In BABAR, this problem arose 
with reference to the taxicab and hotel scenes. For Instance, on pages 6-7 
there Is the following sentence: 

12. The Babar famjly Is waiting for a taxi. 
The Illustration shows the Babar family standing next to a group of humans. 
At the extreme left Is the front one-third of an automob41e with a "knob" 
on top which has the word TAXI written on It. Is It plausible to expect a 
picture-reader (who cannot read TAXI ) to recognize that this auto Is a taxi, 
and Infer that the Babar family Is waiting for It? [it Is not clear that 
even that Inference Is Invited— they could be waiting for the light to 
change. It may be only our knowledge that frequently when one takes a train 
trip (cf.. pages 4-5), one takes taxi to a more "sped f I c" destination that 
makes this Inference not unreasonable. But this Is treated elsewhere.) The 
point Is, what docs an Illiterate have to know about taxis to recognize this 
object as a taxi? Is the half- Inch by three-eighths Inch "knob" with letters 
on It enough? Does the short-billed hat on the head of the driver help 
enough? Or Is It part of the meaning of taxi that taxis have "knobs" on top? 

Thc_prpblems jn^ the next scene are. probably more evident. Sentences 13 

and 1A read: 

13. The taxi takes them to their hotel. 

I'*. Celeste and the children walk Inside. 
Disregarding the taxi (part of the right front fender and grill are shown), 
the Illustration shows Celeste entering a stone or concrete edifice through 
one of two doors In arched doorways festooned with Neo-Gothlc or Baroque or 
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Roman heads and Iron grill work. Part of a wrought,-! ron balcony Is shown 
above the doors. In other words, It Is a stereotypical European hotel.. Can 
a picture- reader be expected to recognize It as a hotel? Even I f he or she 
knows what a hotel Is, It does not bear much resemblance to an American hotel 
(outside of New York, anyway). ' 

6) Illustrations that are fully I nterpretabi e only by Inferences re- 
lating them to preceding Illustrations . The final Identified problems In- 
volve Illustrations that cannot be Interpreted without reference to previous 
Illustrations. 'In the Illustrations just described. If one Infers that the 
1/25 of an auto shown on page 8 Is the same "as the 1/3 of a taxi shown on 
page 6, then It Is reasonable to Infer that It brought the elephants. to the 
location shown on pages 8-9. But If one does not connect the Illustrations 
like this. It Is just 1/25 of an auto. Continuing,' pages 10-11 show the 
elephants, a chair, a hassock, and an open red suitcase. If the reader 
interprets pages 8-9 as saying that the elephants enter a hotel, and connects 
it to this illustration, he may be lable to Infer the content of the initial 
adverbial phrase in sentence 17. 

17. In their hotel room, Celeste opens all the bags. 
But without pages 8-9, the illustration on pages lO-ll' says nothing di rectly 
about the locatton of the scene. The chair, hassock, and open suitcase in- 
vite an inference of a room, perhaps even a room where they are scheduled to 
spend the night, but no background is shown at all— no walls, no windows. 

Pages '♦8t'51 exemplify the opposite situation— an Illustration that can 
be interpreted correctly (with respect to the accompanying text) only by 
taking into consideration a subsequent illustration. On page we have an 
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M lustration of the Babar family running after a man who Is running down 
some stairs which are flanked by a sign which says METRO , accompanied by the 
text: 

126. Now the whole Babar family cliases the man with the bag. 

127. Down the stairs of the, subway. 

128. They all follow him, shouting, "Stop, please, Mr. Mustache!" 
Nothing in this Illustration (and precious little In the text, taken at Its 
face value) indlcc^tes that the Babar family descend the stairs. The Illus- 
tration on pages 50-51 shows the heads of Batfar, Arthur, Zephir, and Celeste 
peeping over a gate which encloses them in a sort of tunnel which Is hung 
with signs reading CORRESPONDANCE arid SORTIE . The maa (fiotice that this Is 

a reference to the previous illustration) is shown stepping through a sliding 
door (more specific, real -world knowledge); an adjacent window shows a seated 

« 

couple; presumably this is a subway train (more real-world subculturat knowl- 
edgeL The point Is, if one can infer that this is a subway, can one infer, 
with respect to the previous page, that the Babar family did follow the man 
down the stairs, which were lean's to the subway? Making the inference is 
not made any easier by the fact that no stairs are shown in the illustration 
on pages 50-51 . 

7) Illustrated content' that has' no apparent relevance to the narrative 
until a number of frames after^its occurrence . The most difficult problem 
Involves information that Is present and clear, but whose relevance does not 
become clear to the picture-reader until many p'ages later, if at all. Take 
the case of the flute and the Mustache-man (the crux of the story): Page 10 
shows Celeste holding up a flute with a look of mild surprise In her eye. No 



flut^ recurs until page 60, where Babar is shown holding one/it, with a \ 
smile on his face, confronting the/a mustachioed man, who js smiling and 
holding a crown. Pages 62-63 show Babar and Celeste in a box at' a Wagnerian ^ 
opera. The orchestra pit is shown, and the flute-player has a mustache. 
The flute is about one and one-quarter Inches by one-sixteenth of an inch; 
the mustache three-sixteenths of an inch by one-sixteenth, so neither is 
very salient. Would or could a picture-reader, on encountering the illus- 
trations on pages ^0-63i remember back to page 10, and deduce in essence 
the plot of the whole story: that the two red bags (on page 6!) got switched, 
and that the Babar family had been running all oyer the city trying to find 
the Mustache-mah who they thought would have Babar* s crown (most recently 
seen on page 3). Could he or she figure from pages 60-63' that t v^as not 
the family's flute, and therefore not their bag, and that therefore someone 
else must have had their bag? . . 

A case which appears' only slightly less di fficul t perceptual ly is that 
of pages 10-13, where the text Indicates .that Babar has lost his crown. 
Could a person Infer from Celeste's surprise (granting that she would Inter- 
pret that much) on page 10, and Babar' s sadness on pages 12-13 (the flute is 
not shown; the red bag is closed; otherwise =the scene is the same as page^* 
10-11), that Babar was sad because his crown was not in the bag? To do so, 
he or she would have to remember Babar packing his crown, which was shown, laut 
not very saliently, on page 3* 

Likewise, the content of sentence. 87 on pages 3^-35 

87. "He is not the Mustache-man at all!" 
would be retrievable frorti the illustrations of the Babar family surrounding 
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a clean-shaven man, If the picture- reader knew that the Babar family had 
been looking for a mustachioed man.^ He might have Inferred this from the 
sequence of J 1 lustrations on pages 2k-27 . These show (pages Zk-2S) Arthur 
climbing out of the canal /river (the Seine actually), and walking In" the 
direction 9~f a man with a mustache and a red bag who appears to be about to 

.board a bus. The Mustache-man and the bus make up about 1/8 of the Illus- 
tration. Pages 26-27 show Arthur running after a bus which Is carrying 
the/a mustachioed man. The red bag Is not shown. Is Arthur's running after 
this man enough to make the chase the" central Issue In the succtsding pages 
for 'the picture-reader? (The/a mustachioed man, who first appears on pages 
k'St appears again before this scene , on page 21, where he Is one of three 

•background figures, taking up, himself, perhaps 1/200 of the Illustration.) 
It is not clear from the illustration (though perhaps It Is supposed to be) 
that the (elephants who are In a boat see him and his red suitcase, and that 

\his prompts Arthur to jump off the boat to get to him (pages 22-23). 

» 

c. Additional Kinds of Text-l 1 lustration Relations C" 

0 Text which functions as description of accompanying Illustration . 
A number of sentences In BABAR, and other Babar , books, can be considered 

J 

grammatically correct only If construed as being descriptions of the accom- 
panying Illustration of the sort that miyht be provided by someone inter- 
preting for a small child a story In a picture-book with no text. Examples 
include sentences 3, 7, 11, 38," 68, 71, 110, and 1.26: 

3, Here are the chl I dren--Pom, Flora and Alexander. (Pages .2-3) > 
k. Here^are Cousin Arthur and his friend Zephir, the monkey. (Pages 

2-3) 

lV7 
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7. NOW the Babar family Is on the train. (Pages kS) 

11. Now they are off the train. (Pages 6-?) 

K 

■ 38. Now they are up In the Eiffel Tower. (Pages 16-17) 

68., But now the man Is on a bus. (Pages 26-^27) 

71.. Arthur Is standing In the sun to dry out his clothes. (Pages 

" r.8-29) . 

110. But now a red light. (Pages '♦2-i»3) 

126. Now the whole Babar family chases the man with the bag. (Pages 

These often are In the present progressive, like sentence 71 » or contain 
certal'rt deictic elements which can be Interpreted only as referring to the 
Illustrations. In all of these cases, the Illustration 'depicts the situa- 
tion described In the now-sentence. If It" did not, such narrative now- 
sentences could only be grammatically construed as equivalent to then - 
statements If the narrative Istakeh to be In the "historical present" which 
Is used when the events described are particularly real and vivid to the 
narrator, and the sequence of events Is very fast-paced. This construdl of 
these senfiences can be ruled out here with some certainty, as the sequence 
of events unfolds rather slowly, and there Is no evidence that the narrator 
Is at all personally Involved in any of the events recounted. His presence 
Is Inferable only from the sentences which describe the 1 1 lustrations, and 
the few evaluative remarks found In such sentences as 115 and l60. 
115. Poor Babar! 
. 160. It turns out to be a gr6at night after all. 
In each case, the now also marks a change of scene In -the story, sometimes 
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with a considerable gap in time, and a gap in the text which the reader has 

ft ^ . ' 

to fill in by inference; for example, on pagies 2-3> .sentences 2 and 7: 

' • 

2. Everyone ts paclcingNbags. 

7t Now the Babar family\is on^the train. 
Nothing was said about bow they^^t oil the train, or what they did between 
packing and getting on the train. Likewise, with reference to sentences 36- 
38 on pages l**-!?, it must be inferred that the family in fact decided to go' 
look^ fo«" the Mustache-man in the Eiffel Tower. • 

36. '"He may be in the Eiffel Tower,'* says Celeste. 

37. ••All visitors t9 Paris go up there.'' 

38. Now they are»up in the Eiffel Tower. 

2) Supplementary information in illustrations . There are no major 
parts of the text which are incomprehensible without the illustrations. The ^ 
illustrations seem to show a general picture of what is happening, each two- 
page i 1 1 ust rat ion dep i ct ing a newjBcene in« the action recounted by the story. 
Thus, the main function of the i 1 1 ustrat ixins , besides a decorative one, seems 
to be to give an actual visual Representation of the individuals, objects, 
and events referred to in the story. Thus, they mJght help the child in/ 
understanding such new notions as "Eiffel Tower," ''hotels" and "sidewalk 
restaurar\^," which might not be familiar to him or her. The whole situation of 
chasing, the Mustache^man down the stairs to the subway and being stopped by 
the gates at the bottom of th^ stairs might be almost incompr^kasnsible to a 
child not familiar with the French Metro system. The illustrations here 
supplement the text by showing parts of these events. In this case neither 
the illustration alone nor the text alone will suffice to provide enough 
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Information f,or an untravclled six- or seven-year- old to fully comprehend 
the situation described; only If a chlltl relates them to each other will he 

or she bu able to make Inferences from the pictures to the text, and vice- 

■. ■ ' 

versa, to make sense of what Is happening. , . ' 

The same kind of text-I llustratlon relation Is - found on the last two 
pages of the stor^ (pages 62-63). On pages 60-61, Babar recovers his crown 
after bumping Into the Mustache-man, who gets his flute back. Pages 62-63 
contain the following text: • • , 

160. It turns out to be a great night after all. 

161. The crown Ii on the head of the KIrig . . . jind the flute Is under ^ 
the Mustache-man's mustache. • I ^ 

■' ■ , <■ ' 

The reader would have a difficult time Inferring fron^ the text alone that 

Babar and Celeste did get to this opera after all, since the^only Indication 
of this In the text Is the mention of .the flute being under the Mustache-man's 
mustache. Inferences are raqulrcd along the following lines: If .the flute 
^Is under his mustache, perha[Js It Is In his mouth, since the mouth Is under 
the mustache. If It Is In his moutH, he liVrobably playing It. - Perhaps he 
Is In the orchestra at the opera. Since the location of the crowji with re- 
spect to the King (Babar) Is mention^ In the same sentence, perhaps the King 
Is at the opera too. # This last Is perhaps still. an Improbable Inference, but 
enhanceid nevertheless by the fact that Babar and Celeste were on their way to 
the opera. In principle, of course, Babar could have decided to have a "great 
night" at a nightclub to celebrate the recovery of his crown. The II lustra- 
tlon suggests the correct Inferences by showing Babar with his crown .si tting 
. In a box at the opera and a mustachioed flautist In the orchestra pit with the 
rest of the orchestra, and this helps the reader make a little more sense of 
the last sentence of the story. 

15^ . 
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d. Further research 

There are perhaps more problems In content and Inference analysis of 
text and accompanying 1 1 lustration which further work wl 11 Identify. Certain 
relatively natural Istic experiments could perhaps be devised to answer some 
of the questions raised above, but It Is not clear how much could be general-^ 
ized from the answers. 

If It Is possible to devise procedures which will permit "the comparison 
of Illustrated texts with respect to the redundancy of accompanying Illustra- 
tions, then experiments can be designed to show what properties of Illus- 
trated texts aid In the acquisition of reading ability, and under what con- 
ditions, and what properties retard or discourage It. 

« 

B. Layout 

1. Intr^uctlop 

Descriptions of the layout'of a text should Include mention of^at least 
the following properties which may contribute to the readability of the text: 

1. Presence of Illustrations 

2. Relative portions of the page occupied by text, Illustrations, 
and %hlte space" 

3. Margins (right Justified or no^.) 
k. SUc of type 

5. Some measure of the number of lines, words per page 

6. Paragraphing 

7. Front matter (title page, background, etc.) 

8. Back matter (Index., comprehension questions, etc.) 



ERIC 



We discuss here only one of the most Interesting and potentially Impor- 
tant aspects ofr^ayout: the effect of line breaks on readability. It Is 
likely that the Interruption at line ends can cause an Interruption In lan- 
guage processing. Experimentation Is needed to determine whether the exact 
placement of line ends (thel r coincidence, or lack of It, with crucial points 

» 

In syntactic and semantic structure, for Instance) Is an Important factor In 
readability. Most likely any effects. would be most pronounced In beginning 
readers. 

Here we explore the correlations between line ends and syntactic struc- 
ture, an<f between line ends and certain aspects of the structure of direct 
quotations. Anyone who has read much modern poetry has noticed that the 
selection of line ends can be exploited for effects that vary from poetry 
to/glmmickry. It is clear that certain combinations of line end and syn- 
tactic properties can have a Jarring effect on the mature reader. It may be 
that this effect Is the symptom of some hang-up .in syntactic processing 
caused by the line end Interruption, which could obstruct normal processing 
and comprehension Irf the immatgre reader. These Judgments are Intuitive and 
Impressionistic; experimental work Is clearly needed to determine the impor- 
tance of this factor. 

An analogous question arises in regard to the relation between line end 
Interruptions and other kinds of linguistic properties. We have taken as a 
case In point the relation between direct quotation and line ends; it is 
conceivable that the processing task of keeping track of direct quotations, 
their contents, and associated speakers. Is made more difficult by the Inter 
ruptlon of 1 ine ends. 
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• , •• ^ 
2: Syntactic Description of Line 

Probably because, of short line length, many sentences were broken at 
line ends. In the Analysis of Babar (Tech. Rep. No. 000), broken sentences 
are listed with surface-structure parsings, .end-of-l Ine being Indicated by 

ijThe depth In the tree representing the surface syntactic structure of 
the sentence at which the break occurred Is Indicated* We suspect that the 
relation between line breaks and various syntactic properties Is significant 
for beginning^ readers » In that the line break Is likely to cause an Inter* 
ruption In syntactic (and other) processing. The analyst recorded Impres- 
sionistic Judgments of the Jarring effect of line breaks, on a scale of 1 
to 7* This rough guess of level of difficulty does not entirely correlate 
with depth. This Is a matter that deserves detailed linguistic and experl- 
men ta I i nves 1 1 ga t i on . 

r 

3. Quotations and Line Ends 

A matter analogous to the relation between syntax and line ends Is the 
role of line ends and other Interruptions In the presentation of direct quo- 
tation; the reader's task Is simultaneously to keep track of who Is speaking 
and to process and comprehend what Is said. This complex task can possibly 
become more dl f f Icul t ' when a single monologue by the same speaker Is broken 
by punctuation, syntax (e.g., Quote-^nlching, as In sentence ^ or line 
ends. Also, readers could easily be confused by unannounced speaker changes 
as in the transition from Babar to Celeste in sentences 31 and 32 of BABAR. 
As it turns out. In BABAR all such speaker-to-speaker transitions Involve 
either p re-announcement of new speaker (as In sentences 50*51) or separation 
of the two speakers by a page break. The question of possible effects on 
beginning readers deserves experimental examination. 

i5,7 
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Footnotes 

^Detailed analyses of some aspects of BABAR (Babar Loses His Crown) 
and DESERT. ("The Wonderful Desert") are given In Reading Center Technical 
Reports 169 and 170. 

^For convenience, we shall often use the term "discourse entity" to 
refer to any such Individual, object, event, etc. evoked by the text. 

hhm play a somewhat similar role to ,Karttunen's "discourse referents" 
(1976). Our alternate terminology rests on a desire to k^ep "referents" a 
separate technical term. 

'See Nash-Webber (l978a) for 4 full explanation of the notation used 
above. As using It here unmotivated and unexplained may lead to more con- 
fusion than It clarifies. In the remainder of this section,' we will appeal , 
to the reader's Intuitions about the results of applying RW-l. 

^Most noun phrases are referential. Non-referential noun phrases In-, 
elude predicate nominatives and appositlves. 

^We are not claiming that he or she consciously applies rule RWrl, an 
obvious Idealization, to some formal representation of that sentence. Just 
that his or her understanding of the sentence has the same effect as RW-l. 

^A siml lar problem arises with sentence 39: 

39. The man with the bag Is not there. 
Only, In the case of 39, there Is notfiing In any Illustration to Indicate 
that they are looking for a man with a bag. Of course, this Is consistent 
with the Illustration of pages 16-17, but so are an Infinite number of other 
content statements, many no more or less likely than this, e.g.. Celeste's 
crpwfi Is npt tl^e.re. President de Gaulle Is not there, etc. 



16" 



J52 

Appendix I: Hansel and Gretel 

From Grimm's Fairy Tales , translated by Z. V. Lucas, L. Crane, and M. Edwards 
copy^rlght (c) 19^5 by Grosset and Diiinlap, Inc., and used with permission. 

Close to a large forest there lived a woodcutter with his wife and his 
two children. The boy was called Hansel and the girl Gretel. They were 
always very poor and had very little to live on. And at one time when there 
was famine In the land, he could no longer procure dally bread. 

One night when he lay |n bed worrying over his troubles, he sighed and 
said to his wife, "What Is to become of us? How are we to feed our poor 
children when we have nothing for ourselvesr' 

"I'll tell you what, husband," answered the woman. "Tomorrow morning 
we will take the children out quite early Into the thickest part of the 
forest. We will light a fire and give each of them a piece of bre^d. Then 
we will go to our work and leave them alone. They won't be able to find 
their way back, and so we shall be rid of them." 

"Nay, wife," said the man,, "we won't do that. I could nfever find It In 
my heart to leave my children alone In the forest. . The wjld animals would 
soon tear them to pieces." 

"What a fool you arel" she said. "Then we must all fo|jr die of ^hunger. 
You may as well plane the boards for our coffins at once." 

She gave him no peace till he consented. "But I grieve o>er the poor 
children all the same," said the man. The two children could not go to 
sleep for hunger either, and they heard what their stepmother said to their 
father. 

Gretel wept bitterly and said, "All Is over with us now." 

161 
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"Be quiet, Gretel said Hansel. "Don't cry! I will find some way out 
of It." 

When the old people had gone to sleep, he got up, put on h|s little coat, 
opened the door, and slipped out. The ntoon was shining brightly and the white 
pebbies round the house shone like newly minted coins. Hansel stooped down 
and put as many into his pockets as they would hold. 

Then he went back to GreteT and said, "Take comfort, little sister, and 
go to sleep. God won't forsake us.** And then he went to bed again. 

At daybreak, before the sun had risen, the woman came and said, "Get up, 
yoM lazybonesi We are going Into the forest to f'etch wood." 

Then she gave them each a piece of bread and said, "Here is something foi 
your dinner, but don't eat It before then, for you'll get no more." 

Gretel put the bread under her apron, for Hansel had the stones in his 
pockets. Then they all started for the forest. 

When they had gone a little way, Hansel stopped and looked back at the 
cottage, and he did the same thing again and again. 

His father said, "Hansel, what are you stopping to look back at? Take 
care and put your best foot foremost." 

"Oh, father," said Hansel, "I am looking at my white cat. It Is sitting 
on the roof, wanting to say good-by to me." 

"Little fool, that's no catl It's the morning sun shining on the chim- 
ney," said the mother. 

But Hansel had not been looking at the cat. He had been dropping a 
pebble on the ground each time he stopped. 

When they reached the middle of the forest, their father said, "Now, 
children, pick up some wood. I want to make a fire to warm you." 
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Hansel and Crete 1 gathered the twigs together'and soon made' a huge pile. 
Then the pile was lighted, and «tfhen It blazed up the woman said, "Now He 
down by the fire and rest yourselves while we go and cut wppd. ' When we have 

• * 

finished we wi 1 1 come back to fetch you.'* 

** ./ « 

Hansel and Gretcl sat by the fire, and when dinnertime came they each 
ate their little bit of bread, and they thought thielr father was quite near 
because they could hear the sound of an ax. It was no ax, however, but a 
branch which the man ^ had tied to a dead tree, and which blew backwards and 
forwards against It. They sat there so long a time that they got tired. 
Then their eyes began to close and they were soon fast asleep. 

When they woke It was dark night. Gretel began to cry, "How shall we 
ever get out of the wood?** 

But Hansel comforted her and said, ••Wait a little while till the moon 
rises, and then we will soon find our way." 

When the full moon rose. Hansel took his little sister ' s. hand and they 
wal'ked on, guided by the pebbles, which glittered like newly coined money. 
They walked the whole night, and at daybreak they found themselves back at 
their father's cottage. 

They knocked at the door, and when the woman opened It anf3 s,aw Hansel 
and Gretel she said, "You bad chl Idren, why did you sleep so long In the 
wood? We thought you did not mean to come-back any more." 

But their father was delighted, for It had gone to his heart to leave 
them behind alone. 
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Appendix 11: The Dog and the Cock 

From The Winston Readers; First Reader by S. G. Firman. copyright„(c) 1918, 
Holt, RInehart, and Winston. 

Once a dog and a cock went into the woods. 
Soon It grew dark. ^ 
the cock said, 'Tet us stay here lill night. I wi 1 1 sleep in this tree* 
top. You can sleep in the hollow trunk.'* 
"Very well/' said the dog. 
So the dog anjd the cock went to sleep. 

In the morning the cock began to crow, "Cock-a-doodle-doJ Cock-a-doodle-" 

dol" 

Mr. Fox heard him crow. 

/ 

He said, "That Is a cock crowing. He must be lost In the woods. I wl 1 1 
eat him for my breakfast.'' 

Soon Mr. Fox saw the cock In the tree- top. 

He said to himself, "Hal hal Hal hal What a fine breakfast I shall havel 
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I {Dust make him come down from the tree. Ha! ha! Ha! ha! 

So he said to the cock, "What a fine cock you are! How well you sing! 
WilI%ou come to my house for breakfast?" 

The cock said, "Yes, thank you, I will come if my friend may com^, too." 

"Oh yes," said the fox. "I will ask your friend. Where is he?". 

The cock said, "My friend is in this hollow tree. He is asleep. You 
must wake him." 

Mr. Fox said to himself, "Hal ha! I- shall, have two cocks for my 
breakfast!" 



Sp he put his head Into the hoUbw tree. 

Then he said, "Will you come to my house for breakfast?" 

Out Jumped the dog. and caught Mr. Fox by the nose. 
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